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[57] ABSTRACT 
Described are methods and means for the construction 
and microbial expression of quasi-synthetic genes aris- 
ing from the combination of organic synthesis and enzy- 
matic reverse transcription from messenger RNA se- 
quences incomplete from the standpoint of the desired 
protein product Preferred products of expression lack 
bic-inactivating leader sequences common in eukary- 
otic expression products but problematic with regard to 
microbial cleavage to yield bioactive material. Illustra- 
tive is a preferred embodiment in which a gene coding 
for human growth hormone (useful in, e.g., treatment of 
hypopituitary dwarfism) is constructed and expressed. 

12 Claims, 5 Drawing Figures 
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mentary sequences, the protruding or "cohesive" ends 

METHOD OF CONSTRUCTING A REPLICABLE can recombine by base pairing. When two different 

CLONING VEHICLE HAVING molecules are cleaved with this enzyme, crosswise pair- 

QUASI-SYNTHETIC GENES ing of the complementary single strands generates a 

5 new DNA molecule, which can be given covalent in- 

BACKGROUND OF THE INVENTION tegrity by using ligase to seal the single strand breaks 

Genetic Expression that remain at the point of annealing. Restriction en- 

T . p.*] * vj.„ w ,~K„„„„i^„ „„;ji „ f ...t.i„i, „„ „ zymes which leave coterminal or "blunt" ends on du- 

The DNA (deoxyribonucleic acid) of which genes ; . . . . , , ., ■ • 

are made comprises both protein-encoding or "struc- ln P kx DNA ,* at has bee " l eav ^ perm " r ecombmatlon 

tural" genes and control regions that mediate the ex- 10 Vla ' e '« ' T4 h S ase wlth other blunt-ended sequences. 

pression of their information through provision of sites CLONING VEHICLES AND RECOMBINANT 

for RNA polymerase binding, information for ribo- DNA 

somal binding sites, etc. Encoded protein is "expressed" 

from its corresponding DNA by a multistep process For present purposes, a "cloning vehicle" is an extra- 

within an organism by which: chromosomal length of duplex DNA comprising an 

1. The enzyme RNA polymerase is activitated in the intact replicon such that the vehicle can be replicated 
control region (hereafter the "promoter") and travels when placed within a unicellular organism ("microbe") 
along the structural gene, transcribing its encoded infer- by transformation. An organism so transformed is 
mation into messenger ribonucleic acid (mRNA) until called a "transformant". Presently, the cloning vehicles 
transcription of translatable mRNA is ended at one or commonly in use are derived from viruses and bacteria 
more "stop" codons. . and most commonly are loops of bacteria DNA called 

2. The mRNA message is translated at the ribosomes "plasmids". 

into a protein for whose amino acid sequence the gene Advances in biochemistry in recent years have led to 
encodes, beginning at a translation "start" signal, most the construction of "recombinant" cloning vehicles in 
commonly ATG (which is transcribed "AUG" and 5 which, for example, plasmids are made to contain exog- 
translated "f-methionine"). enous DNA. In particular instances the recombinant 
In accordance with the genetic code, DNA specifies may include "heterologous" DNA, by which is meant 
each amino acid by a triplet or "codon" of three adja- DNA that codes for polypeptides ordinarily not pro- 
cent nucleotides individually chosen from adenosine, duced by the organism susceptible to transformation by 
thymidine, cytidine and guanine or, as used herein, tne recombinant vehicle. Thus, plasmids are cleaved 
A,T,C, or G. These appear in the coding strand or with restriction enzymes to provide linear DNA having 
coding sequence of double-stranded ("duplex") DNA, ii gata ble termini. These ar bound to an exogenous gene 
whose remaining or "complementary" strand is formed having u^Me termini to provide a biologically func- 
of nucleotides/ 'bases ) which hydrogen bond to their tional mdet with m ktact Ucon and a phenotypical 
complements in the coding strand. A complements T, rt usefm ^ Sfilectin transformants . The recom . 
and^C complements G. These and other subjects relat- binan{ moiet is inserted intQ a micr00rganism by trans . 
mg to tne background ot tne invention are discussed a formation md the transformant 

is isolated and cloned, 

^^^^^^sr£ the obje i s obtaining large popu r ons r r 

„*t,„ - ui:„„+:„ ii a a * u • • * v an elude copies of the exogenous gene and, in particular 

other publications alluded to herein are incorporated by w , u- ^ r ■ *if 

reference cases, with the further object of expressing the protein 

for which the gene codes. The associated technology 
DNS CLEAVAGE AND LIGATION and its potential applications are reviewed in extenso in 
A variety of techniques are available for DNA re- the m™ional Symposium Series 10: Recombi- 
combination, according to which adjoining ends of « ^ ^h^Inv^on Sc^^a^Soe^ Bern and 
separate DNA fragments are tailored in one way or Bosseff ' eds " Raven Press ' N ' Y - < 1977 )' 
another to facilitate ligation. The latter term refers to RECOMBINANT DNA EXPRESSION 
the formation of phosphodiester bonds between adjoin- , , * , . .-, 
ing nucleotides, most often through the agency of the Aslde from ±e use of clomtl g vehicles to increase the 
enzyme T4 DNA ligase. Thus, blunt ends may be di- 50 supply of genes by replication, there have been at- 
rectly ligated. Alternatively, fragments containing com- tempts, some successful, to actually express proteins for 
plementary single strands at their adjoining ends are whlch the S enes code ' In the flrst such instance a gene 
advantaged by hydrogen bonding which positions the for the brain hormone somatostation under the influ- 
respective ends for subsequent ligation. Such single ence of the lac promotor was expressed in E. Coli bac- 
strands, referred to as cohesive termini, may be formed 55 teria. K. Itakura et al„ Science 198, 1056 (1977). More 
by the addition of nucleotides to blunt ends using termi- recently, the A and B chains of human insulin were 
nal transferase, and sometimes simply by chewing back expressed in the same fashion and combined to form the 
one strand of a blunt end with an enzyme such X-exonu- hormone. D. V. Goeddel et al., Proc. Natl. Acad. ScL, 
clease. Again, and most commonly, resort may be had U.S.A. 76, 106 (1979). In each case the genes were con- 
to restriction endonucleases (hereafter, "restriction en- 60 structed in their entirety by synthesis. In each case, 
zymes"), which cleave phosphodiester bonds in and proteolytic enzymes within the cell would apparently 
around unique sequences of nucleotides of about 4-6 degrade the desired product, necessitating its produc- 
base pairs in length ("restriction sites"). Many restric- tion in conjugated form, i.e., in tandem with another 
tion enzymes and their recognition sites are known. See, protein which protected it by compartmentalization and 
e.g., R. J. Roberts, CRC Critical Reviews in Biochemis- 65 which could be extracellulary cleaved away to yield the 
try, 123 (November 1976). Many make staggered cuts product intended. This work is described in the follow- 
that generate short complementary single-stranded se- ing published British patent specifications of the as- 
quences at the ends of the duplex fragments. As comple- signee of the present application: GB No. 2,007,675 A; 
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ZW,mT' m A; ° B N °' 2,007,676 A; an<1 ° B N °' SUMMARY OF THE INVENTION 

While the synthetic gene approach has proven useful The present invention provides methods and means 
in the several cases thus far discussed, real difficulties for expressing quasi-synthetic genes wherein reverse 
arise in the case of far larger protein products, e.g., 5 transcription provides a substantial portion, preferably a 
growth hormone, interferon, etc., whose genes are cor- majority, of the coding sequence without laborious 
respondingly more complex and less susceptible to fac- resort to entirely synthetic construction, while synthesis 
ile synthesis. At the same time, it would be desirable to of the remainder of the coding sequence affords a corn- 
express such products unaccompanied by conjugate pleted gene capable of expressing the desired polypep- 
protein, the necessity of whose expression requires di- 10 tide unaccompanied by bio-inactivating leader sequen- 
version of resources within the organism better commit- ces or other extraneous protein. Alternatively, the syn- 
ted to construction of the intended product. thetic remainder may yield a proteolysis-resistant con- 
Other workers have attempted to express genes de- j u 8 ate so engineered as to permit extra-cellular cleavage 
rived not by organic synthesis but rather by reverse of extraneous protein, yielding the bioactive form. The 
transcription from the corresponding messenger RNA 15 invention accordingly makes available methods and 
purified from tissue. Two problems have attended this means for microbial production of numerous materials 
approach. To begin with, reverse transcriptase may mthert ? Produced only in limited quantity by costly 
stop transcription from mRNA short of completing extraction from tissue, and still others previously inca- 
cDNA for the entire amino acid sequence desired. pable of industrial manufacture. In its most preferred 
Thus, for example, Villa-Komaroff et al obtained 20 embodiment the invention represents the first occasion 
cDNA for rat proinsulin which lacked codons for the J? which a ™dically significant polypeptide hormone 
first three amino acids of the insulin precursor. Proc. 0 1 ™ S™ wth hormone) has been bactenally ex- 
Natl Acad. Set. U.S.A. 75 3727 (1978) Again, reverse P 1 "^ whlle .f 0,d,ng b °* »ntraceUular proteolysis 
transcription of mRNA for polypeptides that are ex- „ f d the ne t cessitv of compartmentalizing the bioactive 
pressed in precursor form hL yielded cDNA for the 25 form m extraneous protem pending extraceUular cleav- 
r i.t. J. 4.u f * • it. * a S e - Microbial sources for human growth hormone 
precursor form rather than the bioactive protem that ^ ^ S for 

results when, in a eukaryotic cell, leader sequences are Je Ues hormone fof treatment of hypopi . 

enzymatically removed. Thus far, no bacteria cell has tl / d ^ arf . hef ^ Qther a u ^ tio J nere . 

been shown to share that capability so that mRNA 3Q tofore ' beyond the ,f apacitv 0 f tissue-derived hormone 

transcripts have yielded expression products containing sources> mcludin ^ bleedi ^^0- 

tfae leader sequences of the precursor form rather than ^ bum th wound healin d str h md bone 

the bioactive protein itself. Villa-Komaroff, supra (rat knitting 

proinsulin); P. H. Seeburg et al., Nature 276, 795 (1978) ^ manner m which these ^ Qther objects md 

(rat pregrowth hormone). 35 advantages of the invention may be obtained will ap- 

Finally, past attempts by others to bactenally express pear more fully from the detailed description which 

hormones (or their precursors) from mRNA transcripts follows, and from the accompanying drawings relating 

have on occasion led only to the production of conju- to a preferred embodiment of the invention, in which: 

gated proteins not apparently amenable to extra-cellular FIG. 1 depicts the synthetic scheme for construction 

cleavage, e.g., Villa-Komaroff, supra, (penicillinase- 40 of a gene fragment coding for the first 24 amino acids of 

proinsulin); Seeburg, supra (beta-lactamase-pregrowth human gr OWt h hormone, together with the start signal 

hormone). ^jq and lingers use( i m cloning. The arrows in the 

HUMAN GROWTH HORMONE coding or upper strand ("U") and in the complementary 

or lower strands ("L") indicate the oligonucleotides 

Human growth hormone ("HGH") is secreted in the 45 j omed to form the depicted fragment; 

human pituitary. It consists of 191 amino acids and, with piG. 2 depicts joinder of the "U" and "L" oligonu- 

its molecular weight of about 21,500, is more than three cleotides to form the gene fragment of FIG. 1, and its 

times as large as insulin. Until the present invention, insertion in a plasmid cloning vehicle; 

human growth hormone could be obtained only by FIG. 3 illustrates the DNA sequence (coding strand 

laborious extraction from a limited source— the pitui- 50 only) of the Hae III restriction enzyme fragment of a 

tary glands of human cadavers. The consequent scarcity pituitary mRNA transcript, with the numbered amino 

of the substance has limited its applications to the treat- acids of human growth hormone for which they code, 

ment of hypopituitary dwarfism, and even here reliable Key restriction sites are indicated, as is DNA (following 

estimates suggest that human-derived HGH is available "stop") for untranslated mRNA; 

in sufficient quantity to serve not more than about 50% 55 FIG. 4 illustrates the construction of a cloning vehi- 

of afflicted subjects. cle for a gene fragment coding for the amino acids of 

In summary, a need has existed for new methods of human growth hormone not synthetically derived, and 

producing HGH and other polypeptide products in the construction of that gene fragment as complemen- 

quantity, and that need has been particularly acute in tary DNA by reverse transcription from mRNA iso- 

the case of polypeptides too large to admit of organic 60 lated from a human pituitary source; and 

synthesis or convenient synthesis of genes from which FIG. 5 illustrates the construction of a plasmid capa- 

the peptide could be expressed. Expression of mamma- ble, in bacteria, of expressing human growth hormone, 

lian hormones from mRNA transcripts has offered the beginning with the plasmids of FIGS. 2 and 4. 

promise of side-stepping difficulties that attend the syn- typtatt fti nF<?rRTPTTnNT of thf 

thetic approach, but until the present has permitted only 65 DETAILED ^C^rTlON OF THE 

microbial production of bio-inactive conjugates from lNVfcJNlHJJN 

which the desired hormone could not practicably be The general approach of the invention involves the 

cleaved. combination in a single cloning vehicle of plural gene 
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fragments which in combination code for expression of 
the desired product. Of these, at least one is a cDNA 

fragment derived by reverse transcription from mRNA l^. a yl^ fe .J / 

isolated from tissue, as by the method of A. Ullrich et a ^P* l^- 

al., Science 196, 1313 (1977). The cDNA provides a 5 ttaagccctgatcgt...^ 

substantial portion, and preferably at least a majority, of aattcgggactagca... 

the codons for the desired product, while remaining 3 . L-> d vX^ L •sl^' 

portions of the gene are supplied synthetically. The ^ < \ 
synthetic and mRNA transcript fragments are cloned 

separately to provide ample quantities for use in the where the endpoint of digestion is indicated by arrow, 

later combination step. reaction conditions for exonwclease digestion may be 

A variety of considerations influence distribution of chosen to remove the upper sequences "a" and "b", 

codons for the end product as between synthetic and where after SI nuclease digestion will automatically 

cDNA, most particularly the DNA sequence of com- 15 u m fa , seauences "c" and "d" Alterna- 

plementary DNA determined as by the method of e ™ ate me lower sequences c and a . Alterna- 

Maxam and Gilbert, Proc. Natl Acad. Scl U.S.A. 74, 560 tlvel y and more P recl sely, one may employ DNA poly- 

(1977). Complementary DNA obtained by reverse tran- nierase digestion in the presence of deoxynucleotide 

scription will invariably contain codons for at least a triphosphates ("d(A,T,C,G)TP"). Thus, in the forego- 

carboxy terminal portion of the desired product, as well 20 ing example, DNA polymerase in the presence of 

as other codons for untranslated mRNA downstream dGTP will remove sequence "c" (then stop at "G"), SI 

from the translation stop signal(s) adjacent the carboxy nuclease will then digest "a"; DNA polymerase in the 

terminus. The presence of DNA for untranslated RNA nce of dTTp wm remove „ d „ (then a( ^ 

is largely irrelevant, although unduly lengthy sequences , , .„ „ . .„ ,, , _ 

„f+i,of w„j m „„k„,,> Z.a v., »„„*2»*i «- 7s and SI nuclease will then excise b", and so on. See 

ot that Kind may be removed, as by restriction enzyme 

cleavage, to conserve cellular resources employed in generally A. Romberg, DNA Synthesis, pp. 87-88, W. 
replicating and expressing the DNA for the intended H. Freeman and Co., San Francisco (1974). 
product. In particular cases, the cDNA will contain More preferably, one may simply construct a restric- 
codons for the entire amino acid sequence desired, as tion site at a convenient point within the portion of the 
well as extraneous condons upstream from the amino cDNA coding for the desired product, by an applica- 
terminus of the intended product. For example, many if tion of the mismatch repair synthes i s technique of A. 
not all polypeptide hormones are expressed in precursor _ . t . „ .,, . , _ . „ ^.-.^o vm-,o\ 
f i i - i r • Razm et al., Proc. Natl Acad. Set. USA 75, 4268 (1978). 
form with leader or signal sequences of protein in- ' , . , . v . 
volved, e.g., in transport to the cellular membrane. In By this technique one or more bases may be substituted 
expression from eukaryotic cells, these sequences are ln an existing DNA sequence, using primers containing 
enzymatically removed, such that the hormone enters the mismatched substituent. At least seven palindromic 
the periplasmic space in its freey bioactive form. How- 4-base pair sequences are uniquely recognized by 
ever, microbial cells cannot be relied upon to perform known restriction enzymes, i.e., AGCT (Alu I), CCGG 
that function, and it is accordingly desirable to remove 40 (Hpa II), CGCG (Tha I), GATC (Sau 3A), GCGC 
sequences coding for such signals or leader sequences ^ GGCC ^ m), and TCGA (Taq I). Where the 
from the mRNA transcript. In the course of that re- \^xt » ■ r 
moval process the translation start signal is also lost, and cDNA se " uence contams a se< J uence dlfferm S from one 
almost invariably some codons for the intended product such slte in a sm S le base ' as statistically is highly likely, 
will be removed as well. The synthetic component of 45 repair synthesis will yield replicate cDNA containing 
the quasi-synthetic gene product of the invention re- the proper, substituent base and hence the desired re- 
turns these latter codons, as well as supplying anew a striction site. Cleavage will delete DNA for the un- 
translation start signal where the vehicle into which the wanted leader, after which synthesis will replace co- 
hybrid gene will ultimately be deployed itself lacks a q dons uifed for sion of the comp i e te polypep- 
properly positioned start. 3V 

Elimination of the leader sequence from pregrowth ' 
hormone cDNA is advantaged by the availability of a 

restriction site within the growth hormone-encoding codons for 

portion of the gene. The invention may nevertheless be „ 1^—— leader — ^ desired product ^ 

practiced without regard to the availability of such a ' 
site, or in any event without regard to the availability of 
a restriction site sufficiently near the amino terminus of 
the desired polypeptide as to obviate the need for exten- 
sive synthesis of the gene component not derived from I 
mRNA. Thus, in any cDNA coding for the desired mismatch repair synthesis 
polypeptide and a leader or other bioinactivating se- I 
quence the boundary between the latter's codons and ▼ 
those of the mature polypeptide will appear from the 
amino acid sequence of the mature polypeptide. One 65 

may simply digest into the gene coding of the peptide of HpaI1 
choice, removing the unwanted leader or other se- 
quence. Thus, for example, given cDNA such as: V 



• CCGG 
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It will be appreciated, of course, that longer restriction 
sites may be likewise intended where desired, or that 
successive repairs may create 4-base pair restriction 
sites where only two bases common to the site appear at 15 
the desired point, etc. 

Applications will appear in which it is desirable to 
express not only the amino acid sequence of the in- 
tended product, but also a measure of extraneous but 
specifically engineered protein. Four such applications 2 rj 
may be mentioned by way of example. First, the quasi- 
synthetic gene may represent a hepten or other immu- 
nological determinant upon which immunogenicity is 
conferred by conjugation to additional protein, such 
that vaccines are produced. See generally, G. B. patent 25 
specification No. 2,008, 123 A. Again, it may be desirable 
for bio-safety reasons to express the intended product as 
a conjugate with other, bio-inactivating protein so de- 
signed as to permit extracellular cleavage to yield the 
active form. Third, applications will be presented in 30 
which transport signal polypeptides will precede the 
desired product, to permit production of the same by 
excretion through the cell membrane, so long as the 
signal peptide can then be cleaved. Finally, extraneous 
conjugate designed to permit specific cleavage extracel- 35 
lularly may be employed to compartmentalize intended 
products otherwise susceptible to degradation by pro- 
teases endogenous to the microbial host. At least in the 
latter three applications, the synthetic adaptor molecu- 
lar employed to complete the coding sequence of the 40 
mRNA transcript can additionally incorporate codons 
for amino acid sequences specifically cleavable, as by 
enzymatic action. For example, trypsin will cleave spe- 
cifically at arg-arg or lys-lys, etc. See GB 2 008 123 A, 
supra. 45 

From the foregoing, it will be seen that in its broadest 
aspect the invention admits of manifold applications, 
each having in common these attributes: 

a mRNA transcript is employed which codes for a 
substantial portion of the intended polypeptide's amino 50 
acid sequence but which, if expressed alone, would 
produce a different polypeptide either smaller or larger 
than the intended product; 

protein-encoding codons for amino acid sequences 
other than those contained in the intended product, if 55 
any, are removed; 

organic synthesis yields fragments) coding for the 
remainder of the desired sequence; and 

the mRNA transcript and synthetic fragments) are 
combined and disposed in a promoter-containing clon- 60 
ing vehicle for replication and expression of either the 
intended product absent extraneous conjugated protein, 
or intended product conjugated to but specifically 
cleavable from extraneous protein. 

Of course, the expression product will in every case 65 
commence with the amino acid coded for by the trans- 
lation start signal (in the case of ATG, f-methionine). 
One can expect this to be removed intracellularly, or in 



any event to leave the bioactivity of the ultimate prod- 
uct essentially unaffected. 

Although it provides a method of general applicabil- 
ity in the production of useful proteins, including anti- 
bodies, enzymes and the like, the invention is particu- 
larly suited to the expression of mammalian polypeptide 
hormones and other substances having medical applica- 
tions, e.g., glucagon, gastrointestinal inhibitory poly- 
peptide, pancreatic polypeptide, adrenocorticotropin, 
beta-endorphins, interferon, urokinase, blood clotting 
factors, human albumin, and so on. A preferred embodi- 
ment illustrative of the invention is next discussed, in 
which a quasi-synthetic gene coding for human growth 
hormone is constructed, cloned and microbially ex- 
pressed. 

CONSTRUCTION AND EXPRESSION OF A 
CLONING VEHICLE FOR HUMAN GROWTH 
HORMONE 

1. Cloning the Hae III fragment of the mRNA 
transcript (FIGS. 3 and 4) 

Polyadenylated mRNA for human growth hormone 
(HGH) was prepared from pituitary growth hormone- 
producing tissue by the procedure of A. Ullrich et al. 
Science 196, 1313 (1977) 1.5 /xg of double strand ("ds") 
cDNA was prepared from 5 \ig of this RNA essentially 
as described by Wickens et al. J. Biol Chem. 253 2483 
(1978), except that RNA polymerase "Klenow frag- 
ment", H. Klenow,Pwc. Nat'l Aci. USA. 65, 168 (1970), 
was substituted for DNA Polymerase I in the second 
strand synthesis. The restriction pattern of HGH is such 
that Hae III restriction sites are present in the 3' non- 
coding region and in the sequence coding for amino 
acids 23 and 24 of HGH, as shown in FIG. 3. Treatment 
of ds HGH cDNA with Hae III gives a DNA fragment 
of 551 base pairs ("bp") coding for amino acids 24-191 
of HGH. Thus, 90 ng of the cDNA was treated with 
Hae III, electrophoresed on an 8% polyacryclamide 
gel, and the region at 550 bp eluted. Approximately 1 ng 
of cDNA was obtained. 

pBR322 prepared as in F. Bolivar et al., Gene 2 (1977) 
95-113 was chosen as the cloning vehicle for the 
cDNA. pBR322 has been fully characterized, J. G. 
Sutcliffe, Cold Spring Harbor Symposium 43, 70 (1978), 
is a multicopy replicating plasmid which exhibits both 
ampicillin and tetracycline resistance owing to its inclu- 
sion of the corresponding genes ("Ap^" and "To"", 
respectively, in FIG. 4), and which contains recognition 
sites for the restriction enzymes Pst I, EcoRI and Hind 
III as shown in the Figure. 

The GC tailing method of Chang, A. C. Y. et al. 
Nature 275 617 (1978) was employed to combine the 
products of Pst I cleavage of pBR322 and of Hae III 
digestion of the mRNA transcript, inserting the cDNA 
fragment into the Pst I site of pBR322 in such manner as 
to restore the Hae III restriction sites (GG | CC) on the 
cDNA while restoring the Pst I restriction sites 
(CTGCA I G) at each end of the insert. 

Thus, terminal deoxynucleotidyl transferase (TdT) 
was used to add approximately 20 dC residues per 3' 
terminus as described previously, Chang, A. Y. C, 
supra. 60 ng of Pst I-treated pBR322 was tailed similarly 
with about 10 dG residues per 3' terminus. Annealing of 
the dC-tailed ds cDNA with the dG-tailed vector DNA 
was performed in 130 ul of 10 mM Tris-HCl (pH 7.5), 
100 mM NaCl, 0.25 mM EDTA. The mixture was 
heated to 70° C, allowed to cool slowly to 37° C. (12 
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hours), then to 20° C. (6 hours) before being used to 
transform E. Coli. X 1776. DNA sequence analysis of 
the plasmid pHGH31 cloned in X 1776 by the method 
of Maxam and Gilbert, Proc. Nat'l. Acad. Sci. USA 74, 
560 (1977) resulted in confirmation of the codons for 5 
amino acids 24—191 of HGH, as shown in FIG. 3. 

E Coli K-12 strain X 1776 has the genotype F~ to- 
nA53 dapD8 minAI supE42 A40[gal-uvrB] \- minB2 
rfb-2 nalA25 oms-2 thyA57* metG65 oms-1 A29[bioH- 
asd] cycB2 cycAl hsdR2. X 1776 has been certified by 10 
the National Institutes of Health as an EK2 host vector 
system. 

X1776 has an. obligate requirement for diaminopi- 
melic acid (DAP) and cannot synthesize the mucopoly- 
saccharide colanic acid. It thus undergoes DAP-less 15 
death in all environments where DAP is limiting but 
sufficient nutrients exist to support cellular metabolism 
and growth. It requires thymine or thymidine and un- 
dergoes thymineless death with degradation of DNA 
when thymine and thymidine are absent from the envi- 20 
ronment but when sufficient nutrients are present to 
sustain metabolic activity. X 1776 is extremely sensitive 
to bile and thus is unable to survive and thus is unable to 
survive passage through the intestinal tract of rats. 
X 1776 is extremely sensitive to detergents, antibiotics, 25 
drugs and chemicals. X 1776 is unable to carry out ei- 
ther dark or photo repair of UV-induced damage and is 
thus several orders of magnitude more sensitive to sun- 
light than wild-type strains of E. Coli. X 1776 is resistant 
to many transducing phages and is conjugation deficient 30 
for inheritance of many different types of conjugative 
plasmids due to the presence of various mutations. 
X 1776 is resistant to nalidixic acid, cycloserine and 
trimethoprim. These drugs can therefore be added to 
media to permit monitoring of the strain and to preclude 35 
transformation of contaminants during transformation. 

X 1776 grows with a generation time of about 50 min. 
in either L broth or Penassay broth when supplemented 
with 100 fig DAP/ml and 4 /u,g thymidine/ml and 
reaches final densities of 8-10 X 10 8 cells/ml at station- 
ary phase. Gentle agitation by swirling and shaking 
back and forth for a period of 1-2 min. adequately sus- 
pends cells with maintenance of 100% viability. Addi- 
tional details concerning X 1776 appear in R. Curtis et 
al., Molecular Cloning of Recombinant DNA, 99-177, 
Scott and Werner, eds., Academic Press (New York 
1977). 

2. Construction and Cloning of the Synthetic Gene 
Fragment (FIGS. 1 and 2) 

The strategy for construction of the HGH quasisyn- 
thetic gene included construction of a synthetic frag- 
ment comprising a blunt-end restriction cleavage site 
adjacent the point at which the fragment would be 
joined to the mRNA transcript. Thus, as shown in FIG. 
1, the synthetic gene for the first 24 amino acids of 
HGH contained a Hae III cleavage site following amino 
acid 23. The distal end of the synthetic fragment was 
provided with a "linker" that permitted annealing to a 
single strand terminal resulting from restriction cleav- 
age in the plasmid in which the mRNA transcript and 
synthetic fragment would ultimately be joined. 

As shown in FIG. 1, the 5' ends of the duplex frag- 
ment have single stranded cohesive termini for the Eco 
RI and Hind III restriction endonucleases to facilitate 
plasmid construction. The methionine codon at the left 
end provides a site for initiation of translation. Twelve 
different oligonucleotides, varying in size from un- 
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decamer to hexadecamer, were synthesized by the im- 
proved phosphotriester method of Crea, R. Proc. Nat'l. 
Acad. Sci. USA 75, 5765 (1978). These oligonucleotides, 
Ui to U6 and Li to L6, are indicated by arrows. 

10 jag amounts of U2 through Ue and L2 through L6 
were phosphorylated using T4 polynucleotide kinase 
and (y 32 -P)ATP by a published procedure. Goeddel, D. 
V. et al. Proc. Nafl. Acad. Sci. USA 76, 106 (1979). 

Three separate T4 ligase catalyzed reactions were 
performed: 10 fig of 5'-OH fragment Ui was combined 
with the phosphorylated U2, L5 and L6; phosphorylated 
U3, U4, L3 and L4 were combined; and 10 jig of 5'-OH 
fragment Li was combined with the phosphorylated L2, 
U5 and U6- These ligations were carried out at 4° C. for 
6 hours in 300 /xl of 20 mM Tris-HCl (pH 7.5), 10 mM 
MgCh, 10 mM dithiothreitol, 0.5 mM ATP using 20 
units of T4 ligase. The three ligation mixtures were then 
combined, 20 units T4 ligase added, and the reaction 
allowed to proceed for 12 hours at 20° C. The mixture 
was ethanol precipitated and electrophoresed on a 10% 
polyacrylamide gel. The band migrating at 84 base pairs 
was sliced from the gel and eluted. pBR322 (1 jug) was 
treated with Eco RI and Hind III, the large fragment 
isolated by gel electrophoresis and ligated to the syn- 
thetic DNA. This mixture was used to transform E. 
Coli. K-12 strain 294 (end A, thi~, hsr~, hsm,t+). Strain 
294 was deposited Oct. 30, 1978 in the American Type 
Culture Collection (ATCC No. 31446), without restric- 
tion. Sequence analysis by the Maxam and Gilbert tech- 
nique, supra, on the Eco RI-Hind III insert from a plas- 
mid pHGH3 of one transformant confirmed that de- 
picted in FIG. 1. 

3. CONSTRUCTION OF PLASMID FOR THE 
BACTERIAL EXPRESSION OF HGH (FIG. 5) 

With the synthetic fragment in pHGH3 and the 
mRNA transcript in pHGH31, a replicable plasmid 
containing both fragments was constructed using the 
expression plasmid pGH6, as shown in FIG. 5. The 
expression plasmid, which contains tandem lac promot- 
ers, was first constructed as follows. A 285 base pair 
Eco RI fragment containing two 95 base pair UV5 lac 
promoter fragments separated by a 95 base pair heterlo- 
gous DNA fragent was isolated from plasmid pKB268, 
K. Backman, et al., Cell, Vol. 13, 65-71 (1978). The 285 
bp fragment was inserted into the Eco RI site of 
pBR322 and a clone pGHl isolated with the promoters 
oriented toward and in proper reading phase with the 
gene for tetracycline resistance. The Eco RI site distal 
to the latter gene was destroyed by partial Eco RI di- 
gestion, repair of the resulting single stranded Eco RI 
ends with DNA polymerase I and recircularization of 
the plasmid by blunt-end ligation. The resulting plas- 
mid, pGH6, contains a single Eco RI site properly posi- 
tioned with respect to the promoter system into which 
the completed gene for HGH could be inserted. 

To ready the synthetic fragment for combination 
with the RNA transcript, 10 jag of pHGH3 was cleaved 
with Eco RI and Hae III restriction endonucleases and 
the 77 base pair fragment containing coding sequences 
for HGH amino acids 1-23 was isolated from an 8% 
polyacrylamide gel. 

The plasmid pHGH 31 (5 jag) was next cleaved with 
Hae III. The 551 bp HGH sequence and a comigrating 
540 bp Hae III fragment of pBR322 were purified by gel 
electrophoresis. Subsequent treatment with Xma I 
cleaved only the HGH sequence, removing 39 base 
pairs from the 3' noncoding region. The resulting 512 bp 
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fragment was separated from the 540 bp pBR322 Hae 
III piece by electrophoresis on a 6% polyacrylamide 
gel. 0.3 u-g of the 77 bp Eco Rl-Hae III fragment was 
polymerized with T4 ligase in a 16 ill reaction vessel for 
14 hours at 4° C. The mixture was heated to 70° C. for 5 
5' to inactivate the ligase, then treated with Eco RI (to 
cleave fragments which had dimerized through their 
Eco RI sites) and with Sma I (to cleave Xma I dimers), 
yielding a 591 bp fragment with an Eco RI "cohesive" 
end and a Sma I "blunt" end. After purification on a 6% 10 
polyacrylamide gel, approximately 30 ng of this frag- 
ment were obtained. It should be noted that the expres- 
sion plasmid pGH6 contains no Xma I recognition site. 
However, Sma I recognizes the same site as Xma I, but 
cuts through the middle of it, yielding blunt ends. The 15 
Sma-cleaved terminus of the fragment derived from 
gHGH31 can accordingly be blunt end ligated into 
pGH6. 

The expression plasmid pGH6, containing tandem lac 
UV5 promoters, was treated successively with Hind III, 20 
nuclease SI, and Eco RI and purified by gel electropho- 
resis. 50 ng of the resulting vector, which had one Eco 
RI cohesive end and one blunt end was ligated to 10 ng 
of the 591 bp HGH DNA. The ligation mixture was 
used to transform E. Coll X1776. Colonies were se- 25 
lected for growth on tetracycline (12.5 jxg/ml). It is 
noteworthy that insertion of the hybrid HGH gene into 
pGH6 destroys the promoter for the tetracycline resis- 
tance gene, but that the tandem lac promoter permits 
read-through of the structural gene for tet resistance, 30 
retaining this selection characteristic. Approximately 
400 transformants were obtained. Filter hybridization 
by the Grunstein-Hogness procedure, Proc. Nafl. Acad. 
Sci. USA, 72 3961 (1975) identified 12 colonies contain- 
ing HGH sequences. The plasmids isolated from three 35 
of these colonies gave the expected restriction patterns 
when cleaved with Hae III, Pvu II, and Pst I. The DNA 
sequence of one clone, pHGH107, was determined. 

Human growth hormone expressed by the transfor- 
mants was easily detected by direct radioimmunoassay 40 
performed on serial dilutions of lysed cell supernatants 
using the Phadebas HGH PRIST kit (Pharmacia). 

To demonstrate that HGH expression is under the 
control of the lac promoter, pHGH107 was transformed 
into E. coli strain D1210 a lac+(i20+z+y+), a lac 45 
repressor overproducer. Meaningful levels of HGH 
expression could not be detected until addition of the 
inducer IPTG (isopropylthiogalactoside). 

Removal of the Eco RI site in pHGH107 would leave 
the ATG start signal the same distance from the ribo- 50 
some binding site condons of the lac promoter as occurs 
in nature between those condons and the start signal for 
B-galaclosidase. To determine whether expression 
would be increased by mimicking this natural spacing 
we converted pHGH107 to pHGH107-l by opening the 55 
former with Eco RI, digesting the resulting single 
strand ends with SI endonuclease, and recircularizing 
by blunt-end ligation with T4 ligase. Although the re- 
sulting plasmid proved likewise capable of expressing 
HGH, it surprisingly did so to a lesser extent that did 60 
pGH107, as shown by direct radioimmunoassay. 

It will be apparent to those skilled in the art that the 
present invention is not limited to the preferred embodi- 
ment just discussed, but rather only to the lawful scope 
of the appended claims. Variations other than those 65 
hitherto discussed will be apparent, whether in the 
choice of promoter system, parental plasmid, intended 
polypeptide product or elsewhere. For example, other 
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promoter systems applicable to the present invention 
include the lambda promoter, the arabinose operon (phi 
80 d ara) or the colicin El, galactose, alkaline phospha- 
tase or tryptophan promoter systems. Host organisms 
for bacterial expression may be chosen, e.g., from 
among the Enterobacteriaceae, such as strains of Esch- 
erichia coli and Salmonella; Bacillaceae, such as bacillus 
subtilis; Pneumococcus; Streptococcus; and Haemoph- 
ilus influenzae. Of course, the choice of organism will 
control the levels of physical containment in cloning 
and expression that should be practiced to comply with 
National Institutes of Health Guidelines for Recombi- 
nant DNA, 43 Fed. Reg. 60,080 (1978). 

While preferred for bench-scale practice of the pres- 
ent invention, E. Coli. X 1776 could prove of limited 
practicality in large-scale industrial manufacture owing 
to the debilitations purposefully incorporated in it for 
biosafety reasons. With appropriate levels of physical, 
rather than biological, containment such organisms as 
E. coli. K-12 strain 294, supra, and E. colt strain RR1, 
genotype: Pro-Leu~Thi-Rs — recA+Str' - Lac y- 
could be employed in larger scale operation. E. coli 
RR1 is derived from E. coli. HB101 (H. W. Boyer, et al., 
/. Mol. Bio. (1969) 41 459-472) by mating with E coli. 
K12 strain KL16 as the Hfr donor. See J. H. Miller, 
Experiments in Molecular Genetics (Cold Spring Harbor, 
N.Y., 1972). A culture of E. coll RR1 was deposited 
Oct. 30, 1978 with the American Type Culture Collec- 
tion, without restriction as to access (ATCC No. 
31343). A culture of X1776 was similarly deposited 
July 3, 1979 in the American Type Culture Collection 
(ATCC No. 31537). Deposits of the following were 
made in the American Type Culture Collection July 3, 
1979: plasmid pHGH107 (ATCC No. 40011); plasmid 
pGH6 (ATCC No. 40012); strain X 1776 transformed 
with pHGH107 (ATCC No. 31538) and E. Coli K12 
strain 294 transformed with pGH6 (ATCC No. 31539). 

Organisms produced according to the invention may 
be employed in industrial scale fermentative production 
of human growth hormone, yielding product in quanti- 
ties and for applications hitherto unattainable. For ex- 
ample, transformant K coli cultures may be grown up in 
aqueous media in a steel or other fermentation vessel 
conventionally aerated and agitated, in aqueous media 
at, e.g., about 37° C. and near neutral pH (e.g., pH 
7+0.3) supplied with appropriate nutriments such as 
carbohydrate or glycerol, nitrogen sources such as am- 
monium sulfate, potassium sources such as potassium 
phosphate, trace elements, magnesium sulfate and the 
like. Transformant organisms preferably exhibit one or 
more selection characteristics, such as antibiotic resis- 
tance, so that selection pressures may be imposed to 
discourage competitive growth of wild-type K coli. As 
an example, in the case of an ampicillin or tetracycline- 
resistant organism the antibiotic may be added to the 
fermentation medium to select out wild-type organisms 
which lack the resistance characteristic. 

Upon completion of fermentation the bacterial sus- 
pension is centrifuged or the cellular solids otherwise 
collected from the broth and then lysed by physical or 
chemical means. Cellular debris is removed from super- 
natant and soluble growth hormone isolated and puri- 
fied. 

Human growth hormone may be purified from bacte- 
rial extracts using one or a combination of (1) polyethyl- 
eneimine fractionation; (2) gel filtration chromatogra- 
phy on Sephacryl S-200; (3) ion exchange chromatogra- 
phy on Biorex-70 resin or CM Sephadex; (4) ammonium 
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sulphate and/or pH fractionation; and (5) affinity chro- 
matography using antibody resins prepared from anti 
-HGH IgG isolated from immunosensitized animals or 
hybridomas; and desorbed under acid or slightly dena- 
turing conditions. S 
We claim: 

1. In the method of constructing a replicable cloning 
vehicle capable, in a microbial organism, of expressing a 
particular polypeptide of known amino acid sequence 
wherein a gene coding for the polypeptide is inserted 10 
into a cloning vehicle and placed under the control of 
an expression promoter, the improvement which com- 
prises: 

(a) obtaining by reverse transcription from messenger 
RNA a first gene fragment for an expression prod- 15 
uct other than said polypeptide, which fragment 
comprises at least a portion of the coding sequence 
for said polypeptide; 

(b) where the first fragment comprises protein-encod- 
ing codons for amino acid sequences other than 20 
those contained in said polypeptide, eliminating the 
same while retaining at least a substantial portion of 
said coding sequence, the resulting fragment never- 
theless coding for an expression product other than 
said polypeptide; 25 

the product of step (a) or, where required, step (b) being 
a fragment encoding less than all of the amino acid 
sequence of said polypeptide; 

(c) providing by organic synthesis one or more syn- 
thetic non-reverse transcript-gene fragments en- 30 
coding the remainder of the amino acid sequence of 
said polypeptide, at least one of said fragments 
coding for the amino-terminal portion of the poly- 
peptide; and 

(d) deploying the synthetic gene fragments) of step 35 
(c) and that produced in step (a) or (b), as the case 
may be, in a replicable cloning vehicle in proper 
reading phase relative to one another and under the 
control of an expression promoter; 

whereby a replicable cloning vehicle capable of ex- 40 
pressing the amino acid sequence of said polypeptide is 
formed. 

2. The method of claim 1 wherein the cloning vehicle 
of step (d) is a bacterial plasmid. 

3. The method of claim 2 wherein the synthetic frag- 45 
ment encoding the amino-terminal portion of the poly- 
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peptide additionally codes for expression of a specifi- 
cally cleavable amino acid sequence, and wherein the 
fragments are deployed downstream from and in read- 
ing phase with expressed protein-encoding condons, 
whereby the conjugated plasmid expression product 
may be specifically cleaved to yield the polypeptide. 

4. The method of claim 2 wherein the amino acid 
sequence of the polypeptide is expressable unaccompa- 
nied by extraneous protein. 

5. The method of claim 4 wherein the fragment of 
step (a) comprises at least a majority of the coding se- 
quence for said polypeptide. 

6. The method of claim 2 wherein a synthetic frag- 
ment and an mRNA transcript fragment are ligated to 
one another before their deployment in the cloning 
vehicle, and wherein the opposite ends of the fragment 
and of the transcript are variously single stranded or 
blunt so as to ensure ligation of the two fragments in the 
proper order for expression of said polypeptide. 

7. The method of claim 5 wherein the polypeptide is 
human growth hormone, and wherein the first fragment 
comprises protein-encoding codons for amino acid se- 
quences other than those in human growth hormone, 
and wherein elimination step (b) yields the Hae III 
restriction enzyme fragment of the first fragment. 

8. The method of claim 7 wherein step (b) includes 
digestion of the Hae III fragment with a different re- 
striction enzyme, cleaving away codons for untran- 
slated messenger RNA and simultaneously providing a 
single-stranded terminus at one end of the resulting 
fragment. 

9. The method of claim 8 wherein the second restric- 
tion enzyme is Xma I. 

10. A method according to claim 1 wherein the poly- 
peptide is human growth hormone and wherein the 
codons for amino acids 1-24 thereof are essentially as 
depicted in FIG. 1. 

11. A method according to claim 4 wherein the poly- 
peptide is human growth hormone and wherein the 
codons for amino acids 1-24 thereof are essentially as 
depicted in FIG. 1. 

12. A method according to claim 7 wherein the co- 
dons for amino acids 1-24 are essentially as depicted in 
FIG. 1. 
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SPECIFICATION 

Method and means for microbial polypeptide expression 

This invention relates to a method and means for microbial polypeptide expression. 
Genetic information is encoded on double-stranded deoxyribonucleic acid ("DNA" or "genes") 
5 according to the order in which the DNA coding strand presents the characteristic bases of its repeating 5 
nucleotide components. "Expression" of the encoded information to form polypeptides involves a two- 
part process. According to the dictates of certain control regions ("regulons") in the gene, RNA 
polymerase may be caused to move along the coding strand, forming messenger RNA (ribonucleic acid) 
in a process called "transcription". In a subsequent "translation" step the cell's ribosomes in 

10 conjunction with transfer RNA convert the /nRNA "message" into polypeptides. Included in the 10 
information /r?RNA transcribes from DNA are signals for the start and termination of ribosomal 
translation, as well as the identity and sequence of the amino acids which make up the polypeptide. The 
DNA coding strand comprises long sequences of nucleotide triplets called "codons" because the 
characteristic bases of the nucleotides in each triplet or codon encode specific bits of information. For 

1 5 example, 3 nucleotides read as ATG (adenine-thymine-guanine) result in an mRNA signal interpreted as 1 5 
"start translation", while termination codons TAG, TAA and TGA are interpreted "stop translation". 
Between the start and stop codons lie the so-called structural gene, whose codons define the amino 
acid sequence ultimately translated. That definition proceeds according to the well-established "genetic 
code" (e.g., J. D. Watson, Molecular Biology of the Gene W. A. Benjamin Inc., N. Y., 3rd ed. 1 976) which 

20 describes the codons for the various amino acids. The genetic code is degenerate in the sense that 20 
different codons may yield the same amino acid, but precise in that for each amino acid there are one or 
more codons for it and no other. Thus, for example, all of the codons TTT, TTC, TTA and TTG, when read 
as such, encode for serine and no other amino acid. During transcription the proper reading phase or 
reading frame must be maintained. Consider for example what happens when the RNA polymerase 

25 transcriber reads different bases as the beginning of a codon (underlined) in the sequence ... 25 
GCTGGTTGTAAG . . .: 

• • ■ GCT GGT TGT AAG ....... Ala-Gly-Cys-Lys . . 

. . . G CTG GTT GTA AG ....... Leu-Val-Leu . . . 

. . . GC TGG TTG TAA A ....... Trp-Leu-(STOP) . 

The polypeptide ultimately produced, then, depends vitally upon the spatial relationship of the structural 
gene with respect to the regulon. 
30 A clearer understanding of the process of genetic expression will emerge once certain components 30 

of genes are defined: 

Operon — A gene comprising structural gene(s) for polypeptide expression and the control region 
("regulon") which regulates that expression. 

Promoter — A gene within the regulon to which RNA polymerase must bind for initiation of 
35 transcription. 35 

Operator — A gene to which repressor protein may bind, thus preventing RNA polymerase binding 
on the adjacent promoter. 

Inducer — A substance which deactivates repressor protein, freeing the operator and permitting 
RNA polymerase to bind to promoter and commence transcription. 
40 Catabolite Activator Protein ("CAP") Binding Site — A gene which binds cyclic adenoisine 40 
monophosphate ("c AMP") — mediated CAP, also commonly required for initiation of transcription. The 
CAP binding site may in particular cases be unnecessary. For example, a promoter mutation in the 
lactose operon of the phage A plac UV5 eliminates the requirement for cAMP and CAP expression. J. 
Beckwith et al, J. Mot. Biol 69, ISS— 1 60 ( 1 972). 
45 Promoter-Operator System — As used herein, an operable control region of an operon, with or 45 

without respect to its inclusion of a CAP binding site or capacity to code for repressor protein expression. 

Further by way of definition, and for use in the discussion of recombinant DNA which follows, we 
define the following: 

Cloning Vehicle — Non-chromosomal double stranded DNA comprising an intact "replicon" such 
50 that the vehicle is replicated, when placed within a unicellular organism ("microbe") by a process of 50 
"transformation". An organism so transformed is called a "transformant". 

Plasmid — For present purposes, a cloning vehicle derived from viruses or bacteria, the latter 
being "bacterial plasmids." 

Complementarity — A property conferred by the base sequences of single strand DNA which 
55 permits the formation of double stranded DNA through hydrogen bonding between complementary 55 
bases on the respective strands. Adenine (A) complements thymine (T), while guanine (G) complements 
" cytosine (C). 

Advances in biochemistry in recent years have led to the construction of "recombinant" cloning 
vehicles in which, for example, plasmids are made to contain exogenous DNA. In particular instances 
60 the recombinant may include "heterologous" DNA, by which is meant DNA that codes for polypeptides 60 
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ordinarily not produced by the organism susceptible to transformation by the recombinant vehicle. Thus, 
plasmids are cleaved to provide linear DNA having ligatable termini. These are bound to an exogenous 
gene having ligatable termini to provide a biologically functional moiety with an intact replicon and a 
desired phenotypical property. The recombinant moiety is inserted into a microorganism by 
5 transformation and transformants are isolated and cloned, with the object of obtaining large populations 5 
capable of expressing the new genetic information. Methods and means of forming recombinant cloning 
vehicles and transforming organisms with them have been widely reported in the literature. See, e.g., H. 
L. HeyneCker et al, Nature 263, 748 — 752 (1976); Cohen et al, Proc. Nat. Acad. Sci. USA 69, 2 1 1 0 
(1972); /Mi, 70, 1293 (1 973); ibid, 70,3240 (1973); ibid., 71, 1030 (1974); Morrow et al, Proc. Nat. " 

10 Acad. Sci. U.SA. 71 , 1743 (1974); Novick, Bacteriological Rev., 33, 210 (1969); Hershfield et al, Proc. 10 
Soc. Nat'/. Acad. Sci. U.S.A. 71, 3455 (1 974) and Jackson et al, ibid. 69, 2904 (1 972). A generalised 
discussion of the subject appears in S. Cohen, Scientific American 233, 24 (1 975). These and other 
publications alluded to herein are incorporated by reference. 

A variety of techniques are available for DNA recombination, according to which adjoining ends of 

1 5 separate DNA fragments are tailored in one way or another to facilitate ligation. The latter term refers to 15- 
the formation of phosphodiester bonds between adjoining nucleotides, most often through the agency 
of the enzyme T4 DNA ligase. Thus, blunt ends may be directly ligated. Alternatively, fragments 
containing complementary single strands at their adjoining ends are advantaged by hydrogen bonding 
which positions the respective ends for subsequent ligation. Such single strands, referred to as cohesive 

20 termini, may be formed by the addition of nucleotides to blunt ends using terminal transferase, and 20 
sometimes simply by chewing back one strand of a blunt end with an enzyme such A-exonuclease. 
Again, and most commonly, resort may be had to restriction endonucleases, which cleave 
phosphodiester bonds in and around unique sequences of nucleotides of about 4 — 6 base pairs in 
length. Many restriction endonucleases and their recognition sites are known, the so-called Eco Rl 

25 endonucleases being most widely employed. Restriction endonucleases which cleave double-stranded 25 
DNA at rotationally symmetric "palindromes" leave cohesive termini. Thus, a plasmid or other cloning 
vehicle may be cleaved, leaving termini each comprising half the restriction endonuclease recognition 
site. A cleavage product of exogenous DNA obtained with the same restriction endonuclease will have 
ends complementary to those of the plasmid termini. Alternatively, as disclosed infra, synthetic DNA 

30 comprising cohesive termini may be provided for insertion into the cleaved vehicle. To discourage 30 
rejoinder of the vehicles' cohesive termini pending insertion of exogenous DNA, the terminii can be 
digested with alkaline phosphatase, providing molecular selection for closures incorporating the 
exogenous fragment. Incorporation of a fragment having the proper orientation raltiye to other aspects 
of the vehicle may be enhanced when the fragment supplants vehicle DNA excised by two different 

35 restriction endonucleases, and itself comprises termini respectively constituting half the recognition 35 
seauence of the different endonucleases. 

Despite wide-ranging work in recent years in recombinant DNA research, few results susceptible 
to immediate and practical application have emerged. This has proven especially so in the case of failed 
attempts to express polypeptides and the like coded for by "synthetic DNA", whether constructed 

40 nucleotide by nucleotide in the conventional fashion or obtained by reverse transcription from isolated 40 
in RNA (complementary or "cDNA"). In this application we describe what appears to represent the first 
expression of a functional polypeptide product from a synthetic gene, together with related 
developments which promise wide-spread application. The product referred to is somatostatin 
(Guillemin U.S.P. 3,904,594), and inhibitor of the secretion of growth hormone, insulin and glucagon 

45 whose effects suggest its application in the treatment of acromelagy, acute pancreatitis and insulin- 45 
dependent diabetes. See R. Guillemin et al. Annual Rev. Med. 27 379 (1 976). The somatostatin model 
clearly demonstrates the applicability of the new developments described here on numerous and 
beneficial fronts, as will appear from the accompanying drawings and more clearly from the detailed 
description which follows. 

50 Summary of Invention 50 
According to the invention there is provided a recombinant plasmid suited for transformation of a 
bacterial host and use therein as a cloning vehicle, wherein the plasmid comprises: 

a) A regulon homologous to the bacterial host in its untransformed state; and 

b) In reading phase with the regulon, a DNA insert coding for the amino acid sequence of a 

55 heterologous polypeptide, such that bacteria transformed by the plasmid are capable of expressing said 55 
amino acid sequence in recoverable form. 

Brief Description of the Drawings 

The accompanying drawings illustrate one context in which preferred embodiments of the 
invention find application, i.e., expression of the hormone somatostatin by bacterial transformants . 
60 containing recombinant plasmids. - 60 

Figure 1 . Schematic outline of the process: the gene for somatostatin, made by chemical DNA 
synthesis, is fused to the E. coli /3-galactosidase gene on the plasmid pBR322. After transformation into 
E. coli, the recombinant plasmid directs the synthesis of a precursor protein which can be specifically 
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cleaved in vitro at methionine residues by cyanogen bromide to yield active mammalian polypeptide 
hormone. A, T, C and G denote the characteristic bases (respectively adenine, thymine, cytosine and 
guanine) of the deoxyribonucleotides in the coding strand of the somatostatin gene. 

Figure 2. Schematic structure of a synthetic gene whose coding strand (i.e., the "upper" strand) 
5 comprises codons for the amino acid sequence of somatostatin (given). 5 
- ' Figure 3. Schematic illustration of preferred method for construction of nucleotide trimers used in 
constructing synthetic genes. In the conventional notation employed to depict nucleotides in Figure 3, 
the 5' OH is to the left and the 3' OH to the right, e.g. 

B 

5' 
HO 

\ 

1 0 Figure 4, Flow chart for the construction of a recombinant plasmid (e.g., pSOM1 1 — 3) capable of 10 

expressing a somatostatin ("SOM")- containing protein, beginning with the parental plasmid pBR322. 
In Figure 4 the approximate molecular weight of each plasmid is stated in daltons ("d"). Ap r and Tc r 
respectively denote genes for ampicillin and tetracycline resistance, while Tc s denotes tetracycline 
susceptibility resulting from excision of a portion of the Tc 1 ' gene. The relative positions of various 

1 5 restriction endonuclease specific cleavage sites on the plasmids are depicted (e.g., Eco Rl, Bam I, etc.). 1 5 
Figures 5A and 5B. The nucleotide sequences of key portions of two plasmids are depicted, as is 
the direction of messenger RNA ("mRNA") transcription, which invariably proceeds from the 5' end of 
the coding strand. Restriction endonuclease substrate sites are as shown. Each depicted sequence 
contains both the control elements of the lac (lactose) operon, and codons for expression of the amino 

20 acid sequence of somatostatin (italics). The amino acid sequence numbers for /5-ga!actosidase ("/5-gal") 20 
are in brackets. 

Figures 6 — 8. As more particularly described in the "Experimental" discussion, infra, these depict 
the results of comparative radioimmune assay experiments which demonstrate the somatostatin 
activity of product expressed by the recombinant plasmids. 
25 Figure 9. Schematic structure of synthetic genes whose coding strands comprise codons for the 25 
amino acid sequences of the A and B strands of human insulin. 

Figure 10. Flow chart for construction of a recombinant plasmid capable of expressing the B chain 
of human insulin. 

Detailed Description 

30 1 . Preparation of Genes Coding for Heterologous Polypeptide 30 
DNA coding for any polypeptide of known amino acid sequence may be prepared by choosing 
codons according to the genetic code. For ease in purification, etc., oligodeoxyribonucleotide fragments 
of, for example, from about 1 1 to about 1 6 nucleotides are prepared separately, then assembled in the 
desired sequence. Thus, one prepares first and second series of oligodeoxyribonucleotide fragments of 

35 convenient size. The first series, when joined in proper sequence, yield a DNA coding strand for 35 
polypeptide expression (see. e.g., Figure 2, fragments A, B, C and D). The second series, when likewise 
joined in proper sequence, yield a strand complementary to the coding strand (e.g., Figure 2, fragments 
' E, F, G and H). The fragments of the respective strands preferably overlap such that complementarity 
promotes their self assembly through hydrogen bonding of the cohesive terminii of fragment blocks. 

40 Following assembly, the structural gene is completed by ligation in the conventional manner. 40 
The degeneracy of the genetic code permits substantial freedom in the choice of codons for any 
given amino acid sequence. For present purposes, however, codon choice was advantageously guided 
by three considerations. First, codons and fragments were selected, and fragment assembly was staged, 
so as to avoid undue complementarity of the fragments, one with another, save for fragments adjacent 

45 one another in the intended gene. Secondly, sequences rich in AT base pairs (e.g., about five or more) 45 
are avoided, particularly when produced by a sequence rich in GC base pairs, to avoid premature 
. termination of transcription. Thirdly, at least a majority of the codons chosen are those preferred in the 
expression of microbial genomes (see, e.g., W. Fiers, et al. Nature 260, 500 (1 976). For purposes of the 
appended claims, we define the following as codons "preferred for the expression of microbial 

50 genomes": 50 
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TABLE I 

PREFERRED ASSIGNMENT OF CODONS 



First Position 




Second Position 
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Most preferably in the case of somatostatin, the amino acid (codon) relationships of the structural 
gene are: gly (GGT); cys (TGT); lys (AAG); trp (TGG); ala (GCT, GCG); asn (AAT, AAC); phe (TTC, TTT); thr 
(ACT, ACG); and ser (TCC, TCG). 
5 Where the structural gene of a desired polypeptide is to be inserted in a cloning vehicle for 5 
expression as such, the gene is preceded by a "start" codon (e.g., ATG) and immediately followed by 
one or more termination or stop codons (see Fig. 2). However, as described infra, the amino acid 
sequence of a particular polypeptide may be expressed with additional protein preceding and/or 
following it. If the intended use of the polypeptide requires cleavage of the additional protein, 

1 0 appropriate cleavage sites are coded for adjacent the polypeptide-additional protein codon junction. * 0 
Thus, in Figure 1 as an example, the expression product is a precursor protein comprising both 
somatostatin and the greatest part of the /}-galactosidase polypeptide. Here ATG is not required to code 
for the start of translation because ribosomal construction of the additional /3-gal protein reads through 
into the somatostatin structural gene. Incorporation of the ATG signal, however, codes for the 

1 5 production of methionine, an amino acid specifically cleaved by cyanogen bromide, affording a facile 1 5 
method for converting precursor protein into the desired polypeptide. 

Figure 2 also exemplifies a further feature preferred in heterologous DNA intended for 
recombinant employment, i.e., the provision of cohesive terminii, preferably comprising one of the two 
strands of a restriction endonuclease recognition site. For reasons previously discussed, the terminii are 

20 preferably designed to create respectively different recognition sites upon recombination. 20 
While the developments described here have been demonstrated as successful with the 
somatostatin model, it will be appreciated that heterologous DNA coding for virtually any known amino 
acid sequence may be employed, mutatis mutandis. Thus, the techniques previously and hereafter 
discussed are applicable, mutatis mutandis, to the production of poly(amino)acids, such as polyleucine 

25: and polyalanine; enzymes; serum proteins; analgesic polypeptides, such as /5-endorphins, which 25 
modulate thresholds of pain, etc. Most preferably, the polypeptides produced as such will be 
mammalian hormones or intermediates therefor. Among such hormones may be mentioned, e.g., 
somatostatin, human insulin, human and bovine growth hormone, leutinizing hormone, ACTH, 
pancreatic polypeptide, etc. Intermediates include, for example, human preproinsulin, human proinsulin, 

30 the A and B chains of human insulin and so on. In addition to DNA made in vitro, the heterologous DNA 30 
may comprise cDNA resulting from reverse transcription from mRNA. See, Ullrich et al. Science 196, 
1313(1977). 

2. Recombinants Coding for the Expression of Precursor Protein 

In the process schematically depicted in Figure 1 , expression yields a precursor protein comprising ' 

35 both a polypeptide coded for by a specific heterologous structural gene (somatostatin) and additional 3B 
protein (comprising a portion of the /3-galactosidase enzyme). A selective cleavage site adjacent the 
somatostatin amino acid sequence permits subsequent separation of the desired polypeptide from 
superfluous protein. The case illustrated is representative of a large class of procedures made available 
by the techniques described herein. 

40 Most commonly, cleavage will be effected outside the replicative environment of the Plasmid or 40 
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other vehicle as, for example, following harvest of the microbial culture. In this fashion temporary 
conjugation of small polypeptides with superfluous protein may preserve the former against, e.g.,. in vivo 
degradation by endogenous enzymes. At the same time, the additional protein will ordinarily rob the 
desired polypeptide of bioactivity pending extra-cellular cleavage, with the effect of enhancing the 
5 biosafety of the procedure. In particular instances, of course, it may prove desirable to effect cleavage 5 
" within the cell. For example, cloning vehicles could be provided with DNA coding for enzymes which 
convert insulin precursors to the active form, operating in tandem with other DNA coded for expression 
of the precursor form. 

In preferred case, the particular polypeptide desired lacks internal cleavage sites corresponding to 

1 0 that employed to shed superfluous protein, although it will be appreciated that where that condition is 1 0 
not satisfied competition reactions will yet give the desired product, albeit in lower yield. Where the 
desired product is methionine-free, cyanogen bromide cleavage at methionine adjacent the desired 
sequence has proven highly effective. Likewise, arginine- and lysine-free products may be enzymatically 
cleaved with, e.g., trypsin or chymotrypsin at arg-arg, lys-lys or like cleavage sites adjacent the desired 

1 5 sequence. In the case where cleavage leaves, e.g., unwanted arginin attached to desired product, it may 1 5 
be removed by carboxypeptidase digestion. When trypsin is employed to cleave ar arg-arg, lysine sites 
within the desired polypeptide may first be protected, as with maleic or citraconic anhydrides. The 
cleavage techniques discussed here by way of example are but representative of the many variants 
which will occur to the art-skilled in light of the specification. 

20 Cleavable protein may be expressed adjacent either the C- or N -terminals of a specific polypeptide, 20 

or even within the polypeptide itself, as in the case of the included sequence which distinguishes 
proinsulin and insulin. Again, the vehicle employed may code for expression of protein comprising 
repeated sequences of the desired polypeptide, each separated by selective cleavage sites. Most 
preferably, however, codons for superfluous protein will be translated in advance of the structural gene 

25 of the desired product, as in the case illustrated in the Figures, in every case care should be taken to 25 
maintain the proper reading frame relative to the regulon. 

3. Expression Of Immunogens 

The ability to express both a specific polypeptide and superfluous protein provides useful tools for 
- the production of immunogenic substances. Polypeptide "haptens" (i.e. substances containing 
30 determina'nts specifically bound by antibodies and the like but ordinarily too small to elicit an immune 30 
response) can be expressed as conjugates with additional protein sufficient in size to confer 
, immunogenicity. Indeed, the /i-gal-somatostatin conjugate produced here by way of example is of 
immunogenic size and may be expected to raise antibodies which bind the somatostatin hapten. 
Proteins comprising in excess of 1 00 amino acids, most commonly in excess of 200 such, exhibit 
35 immunogenic character. 35 
Conjugates prepared in the foregoing fashion may be employed to raise antibodies useful in 
radioimmune or other assays for the hapten, and alternatively in the production of vaccines. We next 
describe an example of the latter application. Cyanogen bromide — or other cleavage products of viral 
protein will yield oligopeptides which bind to anibody raised to the protein itself. Given the amino acid 
40 sequence of such an oligopeptide hapten, heterologous DNA therefore may be expressed as a conjugate 40 
with additional protein which confers immunogenicity. Use of such conjugates as vaccines could be 
expected to diminish side reactions which accompany use of coat protein itself to confer immunity. 

4. The Control Elements 

Figure 1 depicts a process wherein a transformant organism expresses polypeptide product from 
45 heterologous DNA brought under the control of a regulon "homologous" to the organism in its 

untransformed state. Thus, lactose-dependent E. Coli. chromosomal DNA comprises a lactose or "lac" 
operon which mediates lactose digestion by, inter alia, elaborating the enzyme /5-galactosidase. In the 
particular instance illustrated, the lac control elements are obtained from a bacteriophage, .1 plac 5, 
which is infective for the E. Coli. The phage's lac operon, in turn, was derived by transduction from the 
50 same bacterial species, hence the "homology". Homologous regulons suitable for use in the disclosed 
process may alternatively derive from plasmidic DNA native to the organism. 

The simplicity and efficiency of the lac promoter-operator system commend its use in the systems 
we describe, as does its ability to be induced by IPTG (isopropylthio-/5~D galactoside). Of course, other 
, operons or portions thereof could be employed as well, e.g., lambda promoter-operator, arabinose 
55 operon (phi 80 c/ara), the colicine El, galactose, alkaline phosphatase or tryptohan operons. Promoter- 
operators derived from the latter (i.e., "tryp operon") would be expected to confer 1 00% repression 
pending induction (with indoleacrylic acid) and harvest. 

5. Plasmid Construction Generally 
The details of the process schematically illustrated in Figure 4 appear from the Experimental 

60 section, infra. At this point, however, it is useful to briefly discuss various of the techniques employed in 60 
constructing the recombinant plasmid of the preferred embodiment. 

The cloning and expression of the synthetic somatostatin gene employed two plasmids. Each 
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plasmid has an £ coRI substrate site at a different region of the /3-galactosidase structural gene (see 
Figures 4 and 5). The insertion of the synthetic somatostatin DNA fragment into the EcoRI sites of these 
plasmids brings the expression of the genetic information in that fragment under control of the lac 
operon controlling elements. Following the insertion of the somatostatin fragment into these plasmids, 
5 translation should result in a somatostatin polypeptide preceded either by 1 0 amino acid (pSOM1 ) of by 5 
virtually the whole /3-galactosidase subunit structure (pS0M1 1 — 3). 

The plasmid construction scheme initiates with plasmid rJBR322, a well-characterized cloning 
vehicle. Introduction of the lac elements to this plasmid was accomplished by insertion of a Hae\\\ 
restriction endonuclease fragment (203 nucleotides) carrying the lac promoter, CAP binding site, 

1 0 operator, ribosome binding site, and the first 7 amino acid codons of the /3-galactosidase structural 1 0 

gene. The Waelll fragment was derived from A plac5 DNA. The EcoRI-cleaved PBR322 plasmid, which 
had its termini repaired with T4 DNA polymerase and deoxyribonucleotide triphosphates, was blunt-end 
ligated to the Waelll fragment to create EcoRI termini at the insertion points. Joining of these Hae\\\ and 
repaired EcoRI termini generate the EcoRI restriction site (see Fig. 4 and 5) at each terminus. 

15 Transformants of E. Coli RR1 with this DNA were selected for resistance to tetracycline (Tc) and 15 
ampicillin (Ap) on 5-bromo-4-chloro-incolylgalactoside (X-gal) medium. On this indicator medium, 
colonies constitutive for the synthesis of -galactosidase, by virtue of the increased number of lac 
operators titrating repressor, are identified by their blue color. Two orientations of the Waelll fragment 
are possible but these were distinguished by the asymmetric location of an Hha restriction site in the 

20 fragment. Plasmid pBH 1 0 was further modified to eliminate the EcoRI endonuclease site distal to the 20 
lac operator (pBH20). 

The eight chemically synthesized oligodeoxyribonucleotides (Fig. 2) were labeled at the 5' termini 
with [ 32p ]- -ATP by polynucleotide kinase and joined with T4 DNA ligase. Through hydrogen bonding 
between the overlapping fragments, the somatostatin gene self-assembles and eventually polymerizes 
25 into larger molecules because of the cohesive restriction site termini. The ligated products were treated 25 
with £coRI and BamW\ restriction endonucleases to generate the somatostatin gene as depicted in 
Figure 2. 

The synthetic somatostatin gene fragment with FcoRI and Bam\ termini was ligated to the pBH20 
plasmid, previously treated with the EcoRI and flamHI restriction endonucleases and alkaline 

30' phosphatase. The treatment with alkaline phosphatase provides a molecular selection for plasmids 30 
carrying the inserted fragment. Ampicillin-resistant transformants obtained with this ligated DNA were 
screened for tetracycline sensitivity and several were examined for the insertion of an EcoRI-fla/nHI 
fragment of the appropriate size. 

Both strands of the EcoR\-BamH\ fragments of plasmids from two clones were analyzed by 

35 nucleotide sequence analysis starting from the BamVW and EcoRi sites. The sequence analysis was 35 
extended into the lac controlling elements; the lac fragment sequence was intact, and in one case, 
pSOM 1 , the nucleotide sequence of both strands were independently determined each giving the 
sequence depicted in Figure 5A. 

The EcoRl-Psf fragment of the pSOM 1 plasmid, with the lac-controlling element, was removed 

40 and replaced with the EcoRI-Psf fragment of pBR322 to produce the plasmid pSOM 1 1 . The EcoRI 40 
fragment of A plac 5, carrying the lac operon control region and most of the /5-galactosidase structural 
gene, was inserted into the EcoRI site of pSOM 1 1 . Two orientations of the EcoRi lac fragment of X plac 
5 were expected. One of these orientations would maintain the proper reading frame into the 
somatostatin gene, the other would not. Analysis of independently isofated clones for somatostatin 

45 activity then identified clones containing the proper oriented gene, of which the clone designated 45 
pSOM1 1 — 3 was one. 

6. The Microorganism 

Various unicellular microorganisms have been proposed as candidates for transformation, such as 
bacteria, fungii and algae. That is, those unicellular organisms which are capable of being grown in 

50 cultures or fermentation. Bacteria are for the most part the most convenient organisms to work with. 50 
Bacteria which are susceptible to transformation include members of the Enterobacteriaceae, such as 
strains of Escherichia coli and Salmonella; Bacillaceae, such as Bacillus subtillis; Pneumococcus; 
Streptococcus, and Haemophilus influenzae. 

The particular organism chosen for the somatostatin work next discussed was E. Coli. strain RR 1 , 

55 genotype: Pro~Leun"hn"hrR B _ M B rec A + Str r Lac y~£. Coli. RR1 is derived from E. Coli. HB 101 (H. WV 55 
Boyer, et a I, J. Mol. Biol. (1 969) 41 , 459 — 472) by mating with E. Coli K 1 2 strain KL1 6 as the Hfr 
donor. See J. H. Miller, Experiments in Molecular Genetics (Cold Spring Harbor, New York, 1 972). 
Cultures of both E. Coli RR1 and E. Coli RR1 (pBR322) have been deposited with the American Type 
Culture Collection without restriction as to access, respectively ATCC Nos. 3 1 343 and 3 1 344. The 

60 somatostatin-producing organism has likewise been deposited [ATCC No. 31447]. .60 
In the case of human insulin, A and B chain genes were cloned in E Coli K — 1 2 strain 294 (end A, 
thi~, hsr", hsm k + ), ATCC No. 31446, and that organism employed in expression of the A chain (E Coli 
K — 1 2 strain 294 [plAI], ATCC No. 3 1 448). The B chain of human insulin was first expressed in a 
derivative of HB101, i.e., E. Coli K — 12 strain D1 21 0 a lac + (i d o + zV + ), and that B gene-containing 
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organism has likewise been deposited (ATCC No. 31449). Alternatively, the B gene may be inserted in 
and expressed from the organism first mentioned. I.e., strain 294." 

EXPERIMENTAL 

I SOMATOSTATIN 

5 1 . Construction of Somatostatin Gene Fragments 5 
Eight oligodeoxyribonucleotides respectively labeled A through H in Figure 2 were first 
constructed, principally by the modified triester method of K. Itakura et al, J. Am. Chem. Soc. 97, 7327 
(1 975). However, in the case of fragments C, E and H resort was had to an improved technique in which 
fully protected trimers are first prepared as basic units for building longer oligodeoxyribonucleotides. 

1 0 The improved technique is schematically depicted in Figure 3, wherein B is thymine, N-benzoylated 1 0 
adenine, N-benzoylated cytosine or N-isobutyrulated guanine. In brief, and with reference to Figure 3, 
with an excess of I (2 mmole), the coupling reaction with II (1 mmole) went almost to completion in 60 
min with the aid of a powerful coupling reagent, 2,4,6-triisopropylbenzenesulfonyl tetrazolide (TPSTe, 4 
mmole; 2). After removal of the 5'- protecting group with 2% benzene sulfonic acid solution, the 5'- 

1 5 hydroxyl dimer V could be separated from an excess of 3'-phosphodiester monomer IV by simple 1 5 

solvent extraction with aqueous NaHC0 3 solution in CHCI 3 . The fully protected trimer block was 
prepared successively from the 5'-hydroxyl dimer V, I (2 mmole), and TPSTe (4 mmole) and isolated by 
chromatography on silica gel, as inS. T. Hunt eta/, Chem. andlnd. 1967, 1868 (1967). The yields of 
trimers made according to the improved technique appear from Table II. 

TABLE II 



Yields of Fully Protected Trimers 



Sequence 


Yield 


Sequence 


Yield 


TTT 


81% 


ATG 


69% 


TTT 


75% 


GCC 


61% 


GGA 


41% 


CCA 


72% 


AGA 


49% 


CAA 


72% 


ATC 


71% 


TTA 


71% 


OCT 


61% 


CAT 


52% 


ACA 


63% 


CCC 


73% 


ACC 


65% 


AAC 


59% 


CGT 


51% 


GAT 


60% 



The eight oligodeoxyribonucleotides, after removal of all protecting groups, were purified by high- 
pressure liquid chromatography on Permaphase AAX (R. A. Henry et al J. Chrom. Set. II, 358 (1973)). 
The purity of each oligomer was checked by homochromatography on thin-layer DEAE-cellulose and 
also by gel electrophoresis in 20% acrylamide slab after labeling of the oligomers with [A- 32f> ]-ATP in the 
25 presence of polynucleotide kinase. One major labeled product was obtained from each DNA fragment. 25 

2. Ligation and Acrylamide Gel Analysis of Somatostatin DNA 

The 5' OH termini of the chemically synthesized fragments A through H were separately 
phosphorylated with T4 polynucleotide kinase. [ 32p ]-p-ATP was used in phosphorylation so that reaction 
products could be monitored autoradiographically, although it will be appreciated that unlabelled ATP 

30 would serve as well were autoradiographically with. Just prior to the kinase reaction, 25 uCi of[y- 30 
32p ]ATP (approx. 1 500 Ci/mMol) (Maxam and Gilbert, Proc. Nat. Acad. Sci. U.S.A. 74, 1 507 (1 977) was 
evaporated to dryness in 0.5 ml Eppendorf tubes. Five micrograms of fragment were incubated with 2 
units of T4 DNA kinase (hydroxylapatite fraction, 2500 units/ml; 27), in 70 mM Tris-HCI pH 7.6, 1 0 mM 
MgCI 2 , 5 mM dithiothreitol in a total volume of 1 50. fi) for20 min at 37°C. To insure maximum 

35 phosphorylation of the fragments for ligation purposes, 1 0 ,ul of a mixture consisting of 70 mM Tris-HCI 35 
pH 7.6, 1 0 mM MgCI 2 , 5 mM dithiothreitol, 0.5 mM ATP and two units of DNA kinase were added and 
incubation continued for an additional 20 min at 7°C. The fragments (250 ng//il) were stored at — 20°C 
without further treatment. Kinased fragments A, B, E, and F (1 .25 fig each) were ligated in a total 
volume of 50 fi\ in 20 mM Tris-HCI pH 7.6, 1 0 mM Mg Cl 2 , 1 0 mM dithiothreitol, 0.5 mM ATP and 2 

40 units of T4 DNA ligase (hydroxylapatite fraction, 400 units/ml; 27), for 1 6 hr at 4°C. Fragments C, D, G 40 
and H were ligated under similar conditions. Samples of 2 fi\ were removed for analysis by 
electrophoresis on a 10% polyacrylamide gel followed by autoradiography (H. L. Heyneker et al, Nature 
263, 748 (1976)) in which unreacted DNA fragments are represented by fast migrating material and 
wherein the monomeric form of the ligated fragments migrate with bromophenol blue dye (BPB). Some 

45 dimerization also occurs by reason of the cohesive ends of the ligated fragments A, B, E and F, and of 45 
the ligated fragments C, D, G and H. These dimers represent the slowest migrating material, and may be 
cleaved by restriction endonuclease feoRI and SamHI, respectively. 

The two half molecules (ligated A + B + E + F and ligated C + D + G + H) were joined by an 
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additional ligation step carried out in a final volume of 1 50 p\ at 4°C for 1 6 hr. One microliter was 
removed for analysis. The reaction mixture was heated for 1 5 min at 65°C to inactivate the T4 DNA 
ligase. The heat treatment does not effect the migration pattern of the DNA mixture. Enough restriction 
endonuclease BamH\ was added to the reaction mixture to cleave the multimeric forms of the 
5 somatostatin DNA in 30 min at 37°C. After the addition of NaCI to 1 00 mM, the DNA was digested 5 
withFcoRI endonuclease. The restriction endonuclease digestions were terminated by phenol- 
chloroform extraction of the DNA. The somatostatin DNA fragment was purified from unreacted and 
partially ligated DNA fragments by preparative electrophoresis on a 1 0% polyacrylamide gel. The band 
containing the somatostatin DNA fragment was excised from the gel and the DNA was eluted by slicing 
10 the gel into small pieces and extracting the DNA with elution buffer (0.5 M ammonium acetate, 10 mM 10 
MgCI 2 , 0.1 mM EDTA, 0.1% SDS) overnight at 65° C. The DNA was precipitated with 2 volumes of 
ethanol, centrifuged, redissolved in 200 fi\ 1 0 mM Tris-HCI pH 7.6 and dialyzed against the same buffer 
resulting in a somatostatin DNA concentration of 4 /xg/ml. 

3. Construction of Recombinant Plasmids 
1 5 Figure 4 schematically depicts the manner in which recombinant plasmids comprising the 1 5 

somatostatin gene were constructed, and may be referred to in connection with the following more 
particularized discussion. 



A. The Parental Plasmid pBR322 

The plasmid chosen for experimental somatostatin cloning was pBR322, a small (molecular wt. 

20 approx. 2.6 megadaltons) plasmid carrying resistance genes to the antibiotics ampicillin (Ap) and 20 
tetracycline (Tc). As indicated in Figure 4, the ampicillin resistance gene includes a cleavage site for the 
restriction endonuclease Pst I, the tetracycline resistance gene includes a similar site for restriction 
endonuclease BamH\, and an EcoRI site is situated between the Ap r and Tc r genes. The plasmid pBR322 
is derived from pBR3 1 3, a 5.8 megadalton Ap r Tc r Col lmm plasmid (R. L Rodriquez et al, ICN— -UCLA 

25 Symposia on Molecular and Cellular Biology 5, 471 — 77 (1 976), R. L. Rodriquez et al. Construction and 25 
Characterization of Cloning Vehicles, in Molecular Mechanisms in the Control of Gene Expression, pp. 
471 — 77, Academic Press, Inc. (1 976). Plasmid pBR322 is characterized and the manner of its 
derivation fully described in F. Bolivar et al, "Construction and Characterization of New Cloning Vehicles 
II. A Multipurpose Cloning System", Gene (November 1 977). 

30 B. Construction of Plasmid pBH10 30 
Five micrograms of plasmid pBR322 DNA was digested with 1 0 units of the restriction 
endonuclease EcoRI in 100 mM Tris-HCI pH 7.6, 100 mM NaCI, 6 mM MgCI 2 at 37°C for 30 min. The 
reaction was terminated by phenol-chloroform extraction; the DNA was then precipitated with two and 
a half volumes of ethanol and resuspended in 50 fi\ of T4 DNA polymerase buffer (67 mM Tris-HCI pH 

35 8.8, 6.7 mM MgCI 2 , 1 6.6 mM (NH 4 ) 2 S0 4 , 1 67 fig/m\ bovine serum albumin, 50 juM of each of the 35 
dNTP's; A. Panet et al, Biochem. 12, 5045 (1 973). The reaction was started by the addition of 2 units of 
T4 DNA polymerase. After incubation for 30 min at 37° the reaction was terminated by a phenol- 
chloroform extraction of the DNA followed by precipitation with ethanol. Three micrograms ofA plac5 
DNA (Shapiro et al Nature 224. 768 (1 969)) was digested for 1 hr at 37°C with the restriction enzyme 

40 Hae\\\ (3 units) in 6 mM Tris-HCI pH 7.6, 6 mM MgCI 2 , 6 mM /3-mercaptoethanol in a final volume of 20 40 
fi\. The reaction was stopped by heating for 1 0 min at 65°C. The pBR322 treated DNA was mixed with 
the Hae\\\ digested X plac5 DNA and blunt-end ligated in a final volume of 30 fx\ with 1 .2 units of T4 
DNA ligase (hydroxylapatite fraction; A. Panet et al, supra) in 20 mM Tris-HCI pH 7.6, 1 0 mM MgCI 2 , 1 0 
mM dithiothreitol, 0.5 mM ATP for 1 2 hrs at 1 2°C. The ligated DNA mixture was dialyzed against 1 0 

45 mM Tris-HCI pH 7.6, and used for transformation of E. coli strain RR1 . Transformants were selected for 45 
. tetracycline and ampicillin resistance on minimal medium, plates containing 40 ^g/ml of 5-bromo-4- 
chloro-colylgalactoside (X-gal) medium (J. H. Miller, Experiments in Molecular Genetics (Cold Spring 
Harbor, New York, 1 972)). Colonies constitutive for the synthesis of /5-galactosidase were identified by 
their blue color. After screening 45 independently isolated blue colonies, three of them were found to 

50 contain plasmid DNA carrying two EcoRI sites separated by approximately 200 base pairs. The position 50 
of an asymmetrically located Hha\ fragment in the 203 b.p. Hae\\\ lac control fragment (W. Gilbert et al, 
in Protein-Ligand Interactions, H. Sand and G. Blauer, Eds. (De Gruyter, Berlin, (1 975) pp. 1 93 — 210) 
allows for the determination of the orientation of the Hae\\\ fragment, now an EcoRI fragment, in these 
plasmids. Plasmid pBH1 0 was shown to carry the fragment in the desired orientation, i.e., lac 

55 transcription going into the Tc r gene of the plasmid. 55 

C. Construction of Plasmid pBH20 

Plasmid pBH1 0 was next modified to eliminate the £coRI site distal to the lac operator. This was 
accomplished by preferential EcoRI endonuclease cleavage at the distal site involving partial protection 
by RNA polymerase of the other EcoRI site localized between theTc r and lac promoters, which are only 
60 about 40 base pairs apart. After binding RNA polymerase, the DNA (5 fig) was digested with EcoRI (1 60 
unit) in a final volume of 10 fi\ for 10 min at 37°C. The reaction was stopped by heating at 65°Cfor 10 
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min. The£coRI cohesive termini were digested with SI nuclease in 25 mM Na-acetate pH 4.5, 300 mM 
NaCI, 1 mM ZnCI 2 at 25°C for 5 min. The reaction mixture was stopped by the addition of EDTA (1 0 
mM final) and Tris-HCI pH 8 (5 mM final). The DNA was phenol-chloroform extracted, ethano! 
precipitated and resuspended in 1 00 fil of T4 DNA ligation buffer. T4 DNA ligase (1 fi\) was added and 
5 the mixture incubated at 12°C for 12 hr. The ligated DNA was transformed in£. coii strain RR1, and 5 
- ApTc r transformants were selected on X-gal-antibiotic medium. Restriction enzyme analysis of DNA 
screened from 1 0 isolated blue colonies revealed that these clones carried plasmid DNA with one £coRI 
site. Seven of these colonies had retained the EcoRl site located between the lac and Tc r promotors. The 
nucleotide sequence from the f coRI site into the lac-control region of one of these plasmids, pBH20, 
1 0 was confirmed. This plasmid was next used to clone the somatostatin gene. 1 0 



D. Construction of Plasmid pSOM 1 

Twenty micrograms of the plasmid pBH20 was digested to completion with restriction 
endonucleases EcoRl and BamHl in a final volume of 50 /ul. Bacterial alkaline phosphatase was added 
(0.1 unit of Worthington BAPF) and incubation was continued for 10 min at 65°C. The reactions were 

1 5 terminated by phenol-chloroform extraction and the DNA was precipitated with 2 volumes of ethanol, 1 5 
centrifuged and dissolved in 50 /xl 1 0 mM Tris-HCI pH 7.6, 1 mM EDTA. The alkaline phosphatase 
treatment effectively prevents self-ligation of the EcoRl, BamHl treated pBH20 DNA, but circular 
recombinant piasmids containing somatostatin DNA can stili be formed upon ligation. Since E. coii RR1 
is transformed with very low efficiency by linear plasmid DNA, the majority of the transformants will 

20 contain recombinant plasmids. Fifty microliters of somatostatin DNA (4 /xg/ml) were ligated with 25 ul 20 
of the BamHl EcoRl, alkaline phosphatase-treated pBH20 DNA in a total volume of 50 /u\ containing 20 
mM Tris-HCI pH 7.6, 10 mM MgCI 2 , 10 mM dithiothreitol, 0.5 mM ATP, and 4 units of T4 DNA ligase at 
22°C. After 10, 20 and 30 min, additional somatostatin DNA (40 ng) was added to the reaction mixture 
(the gradual addition of somatostatin DNA may favor ligation to the plasmid over self-ligation). Ligation 

25 was continued for 1 hr followed by dialysis of the mixture against 1 0 mM Tris-HCI pH 7.6. In a control 25 
experiment, BamHl, EcoRl, alkaline phosphatase-treated pBH20 DNA was ligated in the absence of 
somatostatin DNA under similar conditions. Both preparations were used without further treatment to 
transform £ col! RR1 . The transformation experiments were carried out in a P3 physical containment 
facility. (National Institutes of Health, U.S.A., Recombinant DNA Research Guidelines, 1 976). 

30 Transformants were selected on minimal medium plates containing 20 ^g/ml Ap and 40 /xg/ml X-gal. 30 
Ten transformants, which were all sensitive to Tc, were isolated. For reference these were designated 

pS0M1 , pS0M2, etc pSOM10. In the control experiment no transformants were obtained. Four out 

of the ten transformants contained plasmids with both an £coRI site and BamHl site. The size of the 
small £coRI, BamHl fragment of these recombinant plasmids was in all four instances similar to the size 

35 of the in vitro prepared somatostatin DNA. Base sequence analysis according to Maxam and Gilbert 35 
Proc. Nat Acad Sci. U.S.A. 74, 560 (1 977), revealed that the plasmid pS0M1 had the desired 
somatostatin DNA fragment inserted. 

The DNA sequence analysis of the clone carrying plasmid pS0M1 predicts that it should produce a 
peptide comprising somatostatin. However no somatostatin radioimmune activity has been detected in 

40 extracts of cell pellets or culture supernatants, nor is the presence of somatostatin detected when the 40 
growing culture is added directly to 70% formic acid and cyanogen bromide. E. coii. RR1 extracts have 
been observed to degrade exogenous somatostatin very rapidly. The absence of somatostatin activity in 
clones carrying plasmid pSOM 1 could well result from intracellular degradation by endogenous 
proteolytic enzymes. Plasmid pSOM 1 was accordingly employed to construct a plasmid coding for a 

45 precursor protein comprising somatostatin and sufficiently large as to be expected to resist proteolytic 45 
degradation. 

E. The Construction of Plasmids pSOM 1 1 and pSOM 1 1 — 3 

A plasmid was constructed In which the somatostatin gene could be located at the C-terminus of 
the /3-galactosidase gene, keeping the translation in phase. The presence of an £coRI site near the C- 

50 terminus of this gene and the available amino acid sequence of this protein (B. Polisky et al, Proc. Nat. 50 
Acad. Sci. U.S.A. 73, 3900 (1 976), A. V. Fowler et al, Id. at 74, 1 507 (1 976), A. I. Bukhari et al. Nature 
New Biology 243, 238 (1973) and K. E. Langley, J. Biol. Chetn. 250, 2587 (1 975)) permitted insertion 
of the fcoRI BamH\ somatostatin gene into the fcoRI site while maintaining the proper reading frame. 
_ For the construction of such a plasmid, pSOM 1 DNA (50 fxq) was digested with the restriction enzymes 

55 £eoRI and Pst\ in a final volume of 1 00 /u\. A preparative 5% polyacrylamide gel was used to separate 55 
the large Pst-EcoRl fragment that carries the somatostatin gene from the small fragment carrying the 
lac control elements. The large band was excised from the gel and the DNA eluted by slicing the gel into 
small pieces and extracting the DNA at 65°C overnight. In a similar way plasmid pBR322 DNA (50 /ug) 
was digested with Pst\ and £coRI restriction endonucleases and the two resulting DNA fragments 

60 purified by preparative electrophoresis on a 5% polyacrylamide gel. The small Psrl-£coRI fragment from 60 
pBR322 ( 1 /ng) was ligated with the large Psrl-EcoRI DNA fragment (5 fig) from pSOM 1 in a final 
volume of 50 ul with 1 unit of T4 DNA ligase at 12°Cfor 12 hrs. The ligated mixture was used to 
transform £. coii RR1 , and transformants were selected for ampicillin resistance on X-gal medium. As 
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expected, almost all the Ap r transformants (95%) gave white colonies (no lac operator) on X-gal 
indicator plates. The resulting plasmid, pSOM1 1, was used in the construction of plasmid pSOM1 1 — 3. 
A mixture of 5 ug of pSOM 1 1 DN A and 5 ug of A plac 5 DNA was digested with £coRI ( 1 0 units for 30 
min at 37'°C). The restriction endonuclease digestion was terminated by phenol-chloroform extraction. 
5 The DNA was then ethanol-precipitated and resuspended in T4 DNA ligase buffer (50 ^l). T4 DNA ligase 5 
(1 unit) was added to the mixture and incubated at 1 2°C for 1 2 hrs. The ligated mixture was dialyzed 
against 10 mM Tris-HCI pH 7.6 and used to transform £ Coli strain RR1. Transformants were selected 
for Ap r on X-gal plates containing ampicillin and screened for constitutive /5-galactosidase production. ■ 
Approximately 2% of the colonies were blue (pSOM1 1 — 1, 1 1 — 2 etc.). Restriction enzyme analysis of 

1 0 plasmid DNA obtained from these clones revealed that all the plasmids carried a new E coRl fragment of 10 
approximately 4.4 megadaltons, which carries the lac operon control sites and most of the /5- 
galactosidase gene. Because two orientations of the E coRl fragment are possible, the asymmetric 
location of a Hind\\\ restriction site was used to determine which of these coloniss were carrying this 
£coRI fragment with lac transcription proceeding into the somatostatin gene. Hind\\\-BamH\ double 

15 digestions indicated that only the clones carrying plasmids pS0M1 1 — 3, pS0M1 1 — 5, pSOM1 1 — 6 15 
and pSOM1 1 — 7 contained the fcoRI fragment in this orientation. 

4. Radioimmune Assay for Somatostatin Activity 

The standard radioimmune assays (R! A) for somatostatin (A. Arimura et al, Proc. Soc. Exp. Biol. 
Med. 148, 784 (1 975)) were modified by decreasing the assay volume and using phosphate buffer. 

20 Tyr 11 somatostatin was iodinated using a chloramine T procedure. (Id.) To assay for somatostatin, the 20 
sample, usually in 70% formic acid containing 5 mg/ml of cyanogen bromide was dried in a conical 
polypropylene tube (0.7 ml, Sarstedt) over moist K0H under vacuum. Twenty microliters of PBSA buffer 
(75 mM NaCI; 75 mM sodium phosphate, pH 7.2; 1 mg/ml bovine serum albumin; and 0.2 mg/ml 
sodium azide) was added, followed by 40 fi\ of a [ 125 l] somatostatin "cocktail" and 20 of a 1, 000-fold 

25 dilution in PBSA of rabbit antisomatostatin immune serum S39 (Vale et al. Metabolism 25, 1 491 25 
( 1 976). The [ ,25 l] somatostatin cocktail contained per ml of PBSA buffer: 250 ug normal rabbit gamma 
globulin (Antibodies, Inc.), 1 500 units protease inhibitor ("Trasylol", Calbiochem) and about 100,000 
counts of [ 125 1] Tyr 1 '-somatostatin. After at least 1 6 hour at room temperature, 0.333 ml of goat anti- 
rabbit gamma globulin (Antibodies, Inc., P=0.03) in PBSA buffer was added to the sample tubes. The 

30 mixture was incubated 2 hr at 37°C, cooled to 5°C, then centrifuged at 10,000 x gfor 5 min. The 30 . 

supernatant was removed and the pellet counted in a gamma counter. With the amount of antiserum 
used, 20% of the counts was precipitated with no unlabeled competing somatostatin. The background 
with infinite somatostatin (200 ng) was usually 3%. One-half maximum competition was obtained with 
10 pg of somatostatin. Initial experiments with extracts off. Coli strain RR1 (the recipient strain) 

35 indicated that less than 1 0 pg of somatostatin could easily be detected in the presence of 1 6 /ug or more 35 
of cyanogen bromide-treated bacterial protein. More than 2 (xq of protein from formic acid-treated 
bacterial extracts interfered somewhat by increasing the background, but cyanogen bromide cleavage 
greatly reduced this interference. Reconstruction experiments showed that somatostatin is stable in 
cyanogen bromide-treated extracts. 

40 A. Competition by Bacterial Extracts 40 
Strains £ Coli RR1 (pSOM11 — 5) and £. Coli RR1 (pSOM11— 4) were grown at37°Cto5x 10 s 
cells/ml in Luria broth. Then IPTG was added to 1 mM and growth continued for 2 hr. One-milliliter 
aliquots were centrifuged for a few seconds in an Eppendorf centrifuge and the pellets were suspended 
in 500 fi\ of 70% formic acid containing 5 mg/ml cyanogen bromide. After approximately 24 hr at room 

45 temperature, aliquots were diluted tenfold in water and the volumes indicated in Figure 6A were 45 
assayed in triplicate for somatostatin. In Figure 6A "B/B 0 " is the ratio of [ 125 l] somatostatin bound in 
the presence of sample to that bound in the absence of competing somatostatin. Each point is the 
average of triplicate tubes. The protein content of the undiluted samples was determined to be 2.2 
mg/ml for £. Co//RR1 (pSOM1 1—5) and 1.5 mg/ml for£. Coli RR1 (pSOM— 4). 

50 B. The Initial Screening of pSOM1 1 Clones for Somatostatin 50 
Cyanogen bromide-treated extracts of 1 1 clones (pSOM 1 1 — 2, pSOM 1 1 — 3, etc.) were made as 
described above for the case of Figure 6A. Thirty microliters of each extract was taken in triplicate for 
radioimmune assay, whose results appear from Figure 6B. The range of assay points is indicated. The 
values for picograms somatostatin were read from a standard curve obtained as part of the same 

55 experiment. 55 



The radioimmune assay results described thus far may be summarized as follows. In contrast to 
the results of experiments with pSOM1, four clones (pS0M11 — 3 11 — 5, 11 — 6, and 11 — 7) were 
found to have easily detectable somatostatin radioimmune activity Figures 6A and 6B. Restriction 
60 fragment analysis revealed that pSOM1 1 — 3, pSOM1 1 — 5, pSOM1 1 — 6 and pS0M1 1 — 7 had the 60 
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desired orientation of the lac operon, whereas pSOM1 1 — 2 and 1 1 — 4 had the opposite orientation. 
Thus there is a perfect correlation between the correct orientation of the lac operon and the production 
of somatostatin radioimmune activity. 

C. Effects of IPTG Induction and CNBr Cleavage on Positive and Negative Clones 
5 The design of the somatostatin plasmid predicts that the synthesis of somatostatin would be 5 
under the control of the lac operon. The lac repressor gene is not included in the plasmid and the 
recipient strain (£ coll RR1 ) contains the wild type chromosomal lac repressor gene which produces 
only 1 0 to 20 repressor molecules per cell (1 5). The plasmid copy number (and therefore the number of 
lac operators) is approximately 20 — 30 per cell, so complete repression is impossible. As shown in 

1 0 Table llf, infra the specific activity of somatostatin in E. «?//RR1 (pSOM1 1- — 3) was increased by IPTG, 1 0 
an inducer of the lac operon. As expected, the level of induction was low, varying from 2.4 to 7 fold. In 
experiment 7 (Table III) a activity, a measure of the first 92 amino acids of /5-galactosidase, also was 
induced by a factor of two. In several experiments no detectable somatostatin radioimmune activity can 
be detected prior to cyanogen bromide cleavage of the total cellular protein. Since the antiserum used in 

1 5 the radioimmune assay, S 39, requires a free N-termina! alanine, no activity was expected prior to 15 
cyanogen bromide cleavage. 

TABLE III 

Somatostatin Radioimmune Specific Activity 



Abbreviations: Luria Broth, LB; isopropylthiogalactoside, IPTG; cyanogen 
bromide, CNBr; somatostatin, SS. Protein was measured by the method of 
Bradford, Anal. Biochem. 72, 248 (1976). 



Experiment 
Number 


Strain 


Medium 


IPTG 
1 mM 


CNBr 
5 mg/ml 


pg SS/ug 
protein 


1 


11-2 


LB 


+ 


+ 


0.1 




11-3 


LB 


+ 


+ 


12 




11-4 


LB 


+ 


+ 


0.4 




11-5 


LB 


+ 


+ 


15 


2 


11-3 


LB 


+ 


+ 


12 




11-3 


LB 


+ 




0.1 


3 


11-3 


LB 


+ 


+ 


w1 




11-3 


LB 




+ 


8 




11-3 


LB 


+ 




0.1 


4 


11-3 


LB 


+ 


+ 


71 




11-3 


VB + glycerol* 


+ 


+ 


62 


5 


11-3 


LB + glycerol 


+ 


+ 


250 


6 


11-3 


LB 


+ 


+ 


320 




11-2 


LB 


+ 


+ 


0.1 


7 


11-3 


LB 


+ 


+ 


24 




11-3 


LB 




+ 


10 



* Vogel-Bonner minimal medium plus glycerol. 



D. Gel Filtration of Cyanogen Bromide-Treated Extracts 

Formic acid and cyanogen-treated extracts of the positive clones (pSOM 1 1 — 3, 1 1 — 5, 1 1 — 6, 

20 and 1 1 — 7) were pooled (Total volume 250 ,«l), dried, and resuspended in 0.1 ml of 50% acetic acid. 20 
[ 3 H] leucine was added and the sample was applied to an 0.7 x 47 cm column of Sephadex G — 50 in 
50% acetic acid. Fifty-microliter aliquots of the column fractions were assayed for somatostatin. 
Pooled negative clone extracts ( 1 1 — 2, 1 1 — 4, and 11 — 11) were treated identically. The results 
appear from Figure 5C. On the same column known somatostatin (Beckman Corp.) elutes as indicated 

25 (SS). In this system, somatostatin is well-separated from excluded large peptides and fully included 25 
small molecules. Only extracts of clones positive for somatostatin exhibited radioimmune activity in the 
column fractions and this activity elutes in the same position as chemically synthesized somatostatin. 

SUMMARY OF ACTIVITY INFORMATION 

The data establishing the synthesis of a a polypeptide containing the somatostatin amino acid 

30 sequence are summarized as follows: (1 ) Somatostatin radioimmune activity is present in E coli cells 30 
having the piasmid pS0M1 1 — 3, which contains a somatostatin gene of proven correct sequence and 
has the correct orientation of the lacfcoRI DNA fragment. Cells with the related plasmid pSOM1 1 — 2, 
which has the same somatostatin gene but an opposite orientation of the lac fcoRI fragment, produce 
no detectable somatostatin activity; (2) As predicted by the design scheme, no detectable somatostatin 

35 radioimmune activity is observed until after cyanogen bromide treatment of the cell extract; (3) The 35 
somatostatin activity is under control of the lac operon as evidenced by induction by IPTG, an inducer of 
the lac operon; (4) The somatostatin activity co-chromatographs with known somatostatin on Sephadex 
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G — 50; (5) The DNA sequence of the cloned somatostatin gene is correct. If translation is out of phase, 
a peptide will be made which is different from somatostatin at every position. Radioimmune activity is 
detected indicating that a peptide closely related to somatostatin is made, and translation must be in 
phase. Since translation occurs in phase, the genetic code dictates that a peptide with the exact 
5 sequence of somatostatin is made; (6) Finally, the above samples of £ coli RR1 (pS0M1 1 — 3) extract . 5 
inhibit the release of growth hormone from rat pituitary cells, whereas samples off. coli RR1 
(pSOM1 1 — 2) prepared in parallel and with identical protein concentration have no effect on growth 
hormone release. 

STABILITY, YIELD, AND PURIFICATION OF SOMATOSTATIN 

10 The strains carrying thefcoRI lac operon fragment (pSOM1 1 — 2, pSOM1 1 — 3, etc.) segregate 10 

with respect to the plasmid phenotype. For example, after about 1 5 generations, about one-half of the 
£ coli RR1 (pSOM1 1 — 3) culture was constitutive for/3-galactosidase, i.e., carried the lac operator, and 
of these about half were ampicillin resistant. Strains positive (pS0M1 1 — 3) and negative (pS0M1 1 — 2) 
for somatostatin are unstable, and therefore, the growth disadvantage presumably comes from the 

1 5 overproduction of the large but incomplete and inactive galactosidase. The yield of somatostatin has 1 5 
varied from 0.001 to 0.03% of the total cellular protein (Table 1 ) probably as the result of the selection 
for cells in culture having plasmids with a deleted lac region. The highest yields of somatostatin have 
been from preparations where growth was started from a single ampicillin resistant, constitutive colony. 
Even in these cases, 30% of the ceils at harvest had deletions of the lac region. Storage in the frozen 

20 state (lyophilization) and growth to harvest from a single such colony is accordingly indicated for the 20 
system described. Yields may be increased by, e.g:, resort to bacterial strains which overproduce lac 
repressor such that expression of precursor protein is essentially totally repressed prior to induction and 
harvest. Alternatively, as previously discussed, a tryptophan or other operator-promoter system which 
ordinarily is totally repressed may be employed. 

25 In the crude extract resulting from cell disruption in, e.g., an Eaton Press, the /3-galactosidase- 25 

somatostatin precursor protein is insoluble and is found in the first low speed centrifugation pellet. The 
activity can be solubilized in 70% formic acid, 6M guanididium hydrochloride, or 2% sodium dodecyl 
sulfate. Most preferably, however, the crude extract from the Eaton Press is extracted with 8M urea and 
the residue cleaved with cyanogen bromide. In initial experiments somatostatin activity derived from £ 

30 coli. strain RR1 (pSOM 1 1 — 3) has been enriched approximately 100-fold by alcohol extraction of the 30 
cleavage product and chromatography on Sephadex G — 50 in 50% acetic acid. When the product is 
again chromatographed on Sephadex G — 50 and then subjected to high pressure liquid 
chromatography, substantially pure somatostatin may be obtained. 

II. HUMAN INSULIN 

35 The techniques previously described were next applied to the production of human insulin. Thus, 35 

the genes for insulin B chain ( 1 04 base pairs) and for insulin A chain (77 base pairs) were designed from 
the amino acid sequence of the human polypeptides, each with single-stranded cohesive termini for the 
EcoRI and BamH\ restriction endonucleases and each designed for insertion separately into pBR322 
plasmids. The synthetic fragments, deca- to pentadeca-nucleotides, were synthesized by the block 

40 phosphotriester method using trinucleotides as building blocks and ultimately purified with high 40 
performance liquid chromatography (HPLC). The human insulin A and B chain synthetic genes were 
then cloned separately in plasmid pBR322. The cloned synthetic genes were fused to an £ Co///5- 
galactosidase gene as before to provide efficient transcription, translation, and a stable precursor 
protein. Insulin peptides were cleaved from ^3-galactosidase precursor, detected by radioimmunoassy, 

45 and purified. Insulin radioimmunoassay activity was then generated by mixing the £ Coli products. 45 

1 . Design and Synthesis of Human Insulin Genes 

The genes constructed for human insulin are depicted in Figure 9. The genes for human insulin, B 
chain and A chain, were designed from the amino acid sequences of the human polypeptides. The 5' 
ends of each gene have single stranded cohesive termini for the £coR1 andSamHI restriction ' 

50 endonucleases, for the correct insertion of each gene into plasmid pBR322. A Hind\\\ endonuclease 50 
recognition site was incorporated into the middle of the B chain gene for the amino acid sequence Glu- 
Ala to allow amplification and verification of each half of the gene separately before the construction of 
the whole B chain gene. The B chain and the A chain genes were designed to be built from 29 different 
oiigodeoxyribonucleotides, varying from decamer to pentadecamers.Each arrow indicates the fragment 

55 synthesized by the improved phosphotriester method, H1 to H8 and B1 to B12 for the B chain gene and 55 
A1 to A1 1 for the A chain gene. " . 

2. Chemical Synthesis of Oiigodeoxyribonucleotides 

Materials and methods for synthesis of oiigodeoxyribonucleotides were essentially those 
described in Itakura, K. et al (1975) J. Biol. Chem. 250, 4592 and Itakura, K. et al (\275)J.Amer. 
60 Chem. Soc. 97, 7327 except for these modifications: 60 
a) The fully protected mononucleotides, 5'-0-dimethoxytrityl-3'-/5-chlorophenyl-/3-cyanoethyl 
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phosphates, were synthesized from the nucleoside derivatives using the monofunctionai 
phosphorylating agent p-chlorophenyl-/5-cyanoethyl phosphorochioridate (1.5 molar equivalent) in 
acetonitrile in the presence of 1 -methyl imidazole Van Boom, J. H. eta! (1975) Tetrahedron 31 , 2953. 
The products were isolated in large scale (1 00 to 300g) by preparative liquid chromatography (Prep 
5 500 LC, Waters Associates). 5 
b) By using the solvent extraction method [Hirose, T. et al (1978) tetrahedron Letters, 2449] 32 
bifunctional trimers were synthesized (see Table IV) in 5 to 10 mmole scale, and 1 3 trimers, 3 
tetramers, and 4 dimers as the 3' terminus blocks, in 1 mmole scale. The homogeneity of the fully 
protected trimers was checked by thin layer chromatography on silica gel in two methanol/chloroform 

1 0 solvent systems: solvent a, 5% v/v and solvent b, 1 0% v/v (See Table IV). Starting from this library of 1 0 
compounds, 29 oligodeoxyribonucleotides of defined sequence were synthesized, 1 8 for the B chain 
and 1 1 for the A chain gene. 

The basic units used to construct polynucleotides were two types of trimer block, i.e. the 
bifunctional trimer blocks of Table IV and corresponding 3'-terminus trimers protected by an anisoyl 

1 5 group at 3'-hydroxy. The bifunctional trimer was hydrolyzed to the corresponding 3'-phosphodiester 1 5 
component with a mixture of pyridine-triethylamine-water (3:1:1 v/v) and also to the corresponding 5'- 
hydroxyl component with 2% benzenesulfonic acid. The 3'-terminus block previously referred to was 
treated with 2% benzenesulfonic acid to give the corresponding 5'-hydroxyl. The coupling reaction of an 
excess of the 3'-phosphodiester trimer (1.5 molar equivalent) with the 5'-hydroxyl component, however 

20 obtained, (1 molar equivalent) in the presence of 2,4,6-triisopropylbenzenesulfony! tetrazolide (TPSTe, 20 
3 to 4 equivalents) went almost to completion diester block reactant the reaction mixture was passed 
through a short silica gel column set up on a sintered glass filter. The column was washed, first with 
CHCI 3 to elute some side products and the coupling reagent, and then with CHCI 3 ;MeOH (95:5 v/v) in 
which almost all of the fully protected oligomer was eluted. Under these conditions, the charged 3'- 

25 phosphodiester block reactant remained in the column. Similarly, block couplings were repeated until 25 
the desired length was constructed. 

High performance liquid chromatography (HPLC) was used extensively during oligonucleotide 
synthesis for a) analysis of each trimer and tetramer block, b) analysis of the intermediate fragments 
(hexamers, monomers, and decamers), c) analysis of the last coupling reaction, and d) purification of the 

30 final products. The HPLC was performed by using a Spectra-Physics 3500B liquid chromatograph. After 30 
removal of-all protecting groups by cone. NH 4 OH at 50°C (6 h) and 80% AcOH at room temperature (15 
min), the compounds were analyzed on a Permaphase AAX (DuPont) [Van Boom, J. et al (1977) J. 
Chromatography 131,1 69.] column (1 m x 2 mm), using a linear gradient of solvent B (0.05M KH 2 P0 4 
— 1.0M KCI, pH 4.5) in solvent A (0.01 M KH 2 P0 4 , pH 4.5). The gradient was formed starting with buffer 

35 A and applying 3% of buffer B per minute. The elution was performed at 60°C, with a flow rate of 2 ml 35 
per minute. The purification of the 29 final oligonucleotides also was performed on Permaphase AAX, 
under the same conditions reported above. The desired peak was pooled, desalted by dialysis, and 
lyophilized. After labeling the 5' termini with (p _32 p)ATP using T 4 polynucleotide kinase, the 
homogeneity of each oligonucleotide was checked by electrophoresis on a 20% polyacrylamide gel. 
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TABLE IV 

SYNTHESIS OF TRIMER BUILDING BLOCKS 







Yield** 




Rf 


Purity*** 


In Figure 9, 


No. 


Compound* 


(%) 


a. 


b. 


(%) 


Present In: 


1. 


AAG 


47 


0.15 


0.40 


93 


B5, Be 


2. 


AAT 


49 


0.25 


0.52 


95 


it j A -t A r\ 

H1, A1, A6 


3, 




52 


0.28 


U.OO 


93 


i-lc Do Ao Ao 

no, oo-, /V, A8 


4. 


AG 1 


43 


0.27 


0.53 


91 


Qvi Dc Ac 

tJ4, do, A6 


5. 


AGG 


56 


0.33 


0.60 


96 




6. 


ACG 


39 


0.18 


0.45 


90 


no, b/ 


7. 




45 


r\ -in 
U.1U 


0.26 


oa 


Wft W7 RQ 

no i n / , Dy 


8. 


rtnT 
AU 1 


oo 

OO 


U.14 


U.4U 


ao 


HO AO A1 i 


9. 


ACsG 


50 


u.iy 


0.48 


o o 


LJq Dh Ac A-ih 
Mo, Dl, :AO| nllJ 


•i n 
IU. 


nun 


4o 




n en 
U.OU 




Aq 

My, 


11. 


1 1 G 


44 


0.26 


0.52 


95 


q j 1 It A o 

b4, B7, -A3 


12. 


i 1 G 


49 


u.n 


0.31 


94 


Llo Uc Ao Ao Ac 
HJ, no, A^, Ad, A5 


13. 


T-CT 


58 


0.24 


0.49 


96 


A A 

A4 


14. 


TCA 


45 


0.28 


0.53 


92 


H1, H2, H4, A1 


15. 


TCG 


39 


0.12 


0.34 


91 


A2 


16. 


TGG 


32 


0.10 


0.28 


87 


H.3, A1, A10 


17. 


TGC 


51 


0.18 


0.47 


93 


H6, B2, A4, A7, A8 


18. 


TGA 


46 


0.12 


0.37 


94 


i [ -j 

n/ 


19. 


TAC 


61 


0.22 


0.50 


90 


B4, A11 


20. 


TAA 


55 


0.17 


0.44 


95 


B5, A10 


21. 


CCT 


53 


0.30 


0.55 


97 


H3, H4, B10 


22. 


CAC 


47 


0.25 


0.51 


92 


A3 


23. 


CAA 


58 


0.25 


0.51 


93 


H2, H6, H8, A7 


24. 


CTT 


41 


0.28 


0.54 


92 


B6, B9, A4 


25. 


CGA 


40 


0.27 


0.52 


93 


A7 


26. 


CGT 


75 


0.25 


0.50 


89 


H2, H4, B3, Bt 


27. 


GGT 


35 


0.09 


0.26 


90 


B3 


28. 


GTT 


46 


0.18 


0.45 


93 


B2 


29. 


GTA 


38 


0.25 


0.50 


95 


B6, B8, A6 


30.. 


GAA 


39 


0.15 


0.39 


88 


H7, B3, B8, A5 


31. 


GAT 


52 


0.22 


0.49 


89 


B10, A9 


32. 


GCA 


42 


0.14 


0.39 


93 


A9 



* Fully protected trideoxynucleotides; 5-09- 
Dimethoxytrityl-3'-p-Chlorophenyl-yS-cyanoethyl phosphate 

** Yield was the overall yield calculated from the 5 '-hydroxy Imonomers. 

*** Based on HPLC analysis. 

3. Assembly and Cloning of B Chain Gene and the A Chain Gene 

The gene for the B chain of insulin was designed to have an EcoRl restriction site on the left end, a 
Hind\\\ site in the middle and BamH\ site at the right end. This was done so that both halves, the left 
5 FcoRl-W/7?Gflll half (BH) and the right Hind\\\-BamW\ half (BB), could be separately cloned in the 
convenient cloning vehicle pBR322 and after their sequences had been verified, joined to give the 
complete B gene (Figure 1 0). The BB half was assembled by ligation from 1 0 oligodeoxyribo- 
nucleotides, labeled B1 to B10 in Figure 9, made by phosphotriester chemical synthesis. B1 and B10 
were not phosphorylated, thereby eliminating unwanted polymerization of these fragments through 

1 0 their cohesive ends [HindWl and fla/nHI). After purification by preparative acrylamide gel electrophoresis 
and elution of the largest DNA band, the BB fragment was inserted into plasmid pBR322 which had 
been cleaved with HindWl and BamH\. About 50% of the ampicillin resistant colonies derived from the 
DNA were sensitive to tetracycline, indicating that a nonplasmid Hin d\\\ -Bam HI fragment had been 
inserted. The small Hind\\\-BamH\ fragments from four of these colonies (pBB1 01 to pBB 1 04) were 

1 5 sequenced and found to be correct as designed. 

The BH fragment was prepared in a similar manner and inserted into pBR322 which had been 
cleaved with £coRI and HindWl restriction endonucleases. Plasmids from three ampicillin resistant, 
tetracycline sensitive transformants (pBH1 to pBH3) were analyzed. The small £eoRI-#/rtc/!ll fragments 
were found to have the expected nucleotide sequence. 

20 The A chain gene was assembled in three parts. The left four, middle four, and right four 

oligonucleotides (see Figure 9) were ligated separately, then mixed and ligated (oligonucleotides A1 and 
A1 2 were unphosphorylated). The assembled A chain gene was phosphorylated, purified by gel 
electrophoresis, and cloned in pBR322 at the EcoR\-BamH\ sites. The FcoRI-f?an?HI fragments from two 
ampicillin resistant, tetracycline sensitive clones (pA1 0, pA1 1 ) contained the desired A gene sequence. 

25 4. Construction of Plasmids for Expression of A and B Insulin Genes 25 
Figure 1 0 illustrates the construction of the lac-insulin B plasmid (plB1 ). Plasmids pBH1 and 



10 
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pBB 1 0 1 were digested with £coRI and Hind\\\ endonucleases. The small BH fragment of pBH 1 and the 
large fragment of pBB1 01 (containing the BB fragment and most of pBR322) were purified by gel 
electrophoresis, mixed, and ligated in the presence of fcoRI-cleaved A plac5. The megadalton £coRI 
fragment of X plac5 contains the lac control region and the majority of the /5-galactosidase structural 
5 gene. The configuration of the restriction sites ensures correct joining of BH to BB. The lac E coRI 5 
fragment can insert in two orientations; thus, only half of the clones obtained after transformation 
should have the desired orientation. The orientation of ten ampicillin resistant, /5-galactosidase 
constitutive clones were checked by restriction analysis. Five of these colonies contained the entire B 
gene sequence and the correct reading frame from the /5-galactosidase gene into the B chain gene. One, 
10 PIB1, was chosen for subsequent experiments. 10 
In a similar experiment, the 4.4 megadalton lac fragment from A placS was introduced into the 
pA11 plasmid at the EcoR\ site to give plA1.plA1 is identical to plB1 except that the A gene fragment is 
substituted for the B gene fragment. DNA sequence analysis demonstrated that the correct A and B 
chain gene sequences were retained in plA1 and plB1 respectively. 

1 5 5. Expression 1 5 

The strains which contain the insulin genes correctly attached to /5-galactosidase both produced 
large quantities of a protein the size of /5-galactosidase. Approximately 20% of the total cellular protein 
was this /5-galactosidase-insulin A or B chain hybrid. The hybrid proteins are insoluble and were found in 
the first low speed pellet where they constitute about 50% of the protein. 

20 To detect the expression of the insulin A and B chains, we used a radioimmunoassay (RIA) based 20 
on the reconstitution of complete insulin from the separate chains. The insulin reconstitution procedure 
of Katsoyannis et al (1 967) Biochemistry 6, 2642 — 2655, adapted to a 27-microliter assay volume, 
provides a very suitable assay. Easily detectable insulin activity is obtained after mixing and . 
reconstituting S-Sulfonated derivatives of the insulin chains. The separated S-sulphonated chains of 

2'5 insulin do not react significantly, after reduction and oxidation, with the anti-insulin antibody used. 25 
To use the reconstitution assay we partially purified the /5-galactosidase-A or B chain hybrid 
protein, cleaved with cyanogen bromide, and formed S-sulfonated derivatives. 

The evidence that we have obtained correct expression from chemically synthesized genes for 
human insulin can be summarized as follows: a) Radioimmune activity has been detected for both 

30 chains, b) The DNA sequences obtained after cloning and plasmid construction have been directly 30 
verified to be correct as designed. Since radioimmune activity is obtained, translation must be in phase. 
Therefore, the genetic code dictates that peptides with the sequences of human insulin are being 
produced, c) The £ coli products, after cyanogen bromide cleavage, behave as insulin chains in three 
different chromatographic systems which separate on different principles (gel filtration, ion exchange, 

35 and reversed phase HPLC). d) The E coli produced A chain has been purified on a small scale by HPLC 35 
and has the correct amino acid composition. 



CLAIMS: 

1 . A recombinant plasmid suited for transfomation of a bacterial host and use therein as a cloning 
vehicle, wherein the plasmid comprises: 

40 a) A regulon homologous to the bacterial host in its untransformed state; and 40 

b) In reading phase with the regulon, a DNA insert coding for the amino acid sequence of a 
heterologous polypeptide, such that bacteria transformed by the plasmid are capable of expressing said 
amino acid sequence in recoverable form. 

2. A bacterial plasmid according to claim 1 . 

45 3. A plasmid according to claim 2 whose regulon is essentially identical to a regulon ordinarily 45 

present in the bacterial host's chromosomal DNA. 

4. A plasmid according to claim 2 wherein the regulon comprises the promoter-operator system of 
an operon selected from the group consisting of E. coli. lac operon and E. coli. tryptophan operon. 

5. A plasmid according to either claim 3 or claim 4 wherein the heterologous polypeptide is a 

50 mammalian hormone or an intermediate therefor. 50 

6. A plasmid according to claim 5 wherein the heterologous polypeptide is selected from the 
group consisting of human preproinsulin, human proinsulin, the A chain of human insulin, the B chain of 
human insulin, human or bovine growth hormone, leutinizing hormone, ACTH and pancreatic 
polypeptide. 

55 7. The plasmid of claim 6 wherein the heterologous polypeptide is a mammalian hormone. 55 

8. The plasmid of claim 5 wherein the heterologous polypeptide is somatostatin. 

9. The plasmid of claim 8 wherein the regulon comprises the promoter-operator system of the £. 
coli. lac operon. 

1 0. A plasmid according to claim 1 , substantially as described in Experimental I or Experimental II 

60 herein. 60 
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@ Method of making a selected protein. 



@ A plasmid or phage gene for a perioiasmic or extra- 
cellular bacterial protein is cleaved, a doublestranded DNA 
sequence coding for a selected protein or portion thereof 
from a eukaryotic cell such as insulin is inserted In that 
Cleaved gene by recombinant DNA techniques and used to 
transform a bacterium, and the excreted selected protein is 
collected. 
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METHOD' OF MAKING A SELECTED PROTEIN 

The invention described herein was made in the 
course of work under a grant or award from the United 
States Department of Health, Education and Welfare. 
TECHNICAL FIELD OF INVENTION 

This invention relates to a process for produ- 
cing specific proteins in bacteria and having them excreted 
from the bacterial cell and pertains more specifically to 
inserting the DNA representing the desired non-bacterial 
protein or part of a protein by recombinant techniques into 
the plasmid or phage gene for either a periplasmic or an 
extracellular protein, hereinafter called a "carrier pro- 
tein", transforming a bacterial host with the recombined 
gene, and culturing the transformed host to excrete the 
protein. The protein thus produced can be collected by 
conventional procedures from the culture medium or from 
the periplasmic space depending upon the choice of carrier 
protein gene. 

BACKGROUND ART 
It is known to insert DNA representing a 
specific protein into the gene for an intracellular protein. 
Scheller et al . , Science, 196, 177-180 (1977). The 
problem however is that proteins made in this way are 
mixed with other intracellular proteins and are therefore 
subject to degradation by enzymes within the cell so that 
there is a problem in obtaining the desired protein product 
in purified form. 

DISCLOSURE OF INVENTION 
The foregoing problem is avoided in accordance 
with our invention by providing a method of making a selected 
protein or portion thereof by inserting DNA representing the 
selected protein or portion thereof into a bacterial gene, 
characterized by cleaving the bacterial gene for an extra- 
cellular or periplasmic carrier protein, inserting into 
the cleavage site by a recombinant step a non-bacterial 
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DNA fragment which codes for the selected protein or 
portion thereof .transforming a bacterial host with the 
recombined gene, and culturing the transformed bacteria 
to excrete the selected protein or portion thereof. 

By way of example, by employing a gene for 
a carrier protein which has a leader sequence of hydro- 
phobic amino acids at its amino terminus and which is 
normally excreted through the membrane of the cell 
within which it is made, with cleavage of the hydrophobic 
leader sequence during excretion, a selected protein 
can be produced which can be recovered either from the 
periplasmic space or. from the medium in which the 
bacterium is grown, depending upon the choice of carrier 
protein. In this way contamination from the other 
proteins within the bacterium is avoided while achieving 
greater stability by avoiding the enzymes within the 
bacterial cell which degrade foreign proteins. 

Among the bacterial genes for carrier proteins 
which can be employed in the present invention are the 
genes for antibiotic resistance, such as the gene for 
penicillin resistance or penicillinase, the gene for 
chloramphenicol resistance, or the gene for tetracycline 
resistance, as well as the gene for alkaline phosphatase 
and the gene for bacterial ribonuclease. 

Genes or DNA fragments which code for a variety 
of proteins or portions thereof can be inserted in the bacterial 
carrier protein gene by the process of the present invention. These 
proteins include a variety of non-bacterial proteins such as 
eukaryotic cell proteins and viral proteins. Of parti- 
cular importance are eukaryotic cell proteins such as 
insulin, human growth hormone, interferon and other 
pharmacologically active proteins. These are synthesized 
by their respective genes as pre-proteins or precursor 
proteins having at their amino terminus a series of 
hydrophobic amino acids. This hydrophobic leader 
sequence is not identical to that for the bacterial 
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proteins which are excreted through the bacterial membrane. 
Therefore.the fact that pre- insulin or other pre-proteins 
of higher cells contain a .hydrophobic leader sequence is in 
itself no basis for expecting that such a pre-protein 
5 could be matured in the bacterial cell even if it could 
be synthesized within the cell. Moreover, the process 
of the present invention, in addition to providing for 
the synthesis within and excretion from bacterial cells 
of matured proteins of eukaryotic cells, which are of 

10 known utility, also makes possible the synthesis in 

and excretion from bacterial cells of other extracellular 
products of commercial interest. These include other 
fused proteins and fused proteins consisting of carrier 
proteins, as defined above, which carry specific deter- 

15 minants, for example, viral antigens such as coat proteins 
or other antigenic proteins of viruses. These latter 
fused proteins are useful in the manufacture of vaccines, 
being capable because of their antigenic character of 
inducing generation of an immune response specific to 

20 the viruses. Such vaccines will be unusually safe 

because they will not contain any live or inactivated 
virus material. Furthermore, it is possible by this 
process to construct vaccines for viruses which cannot 
be grown in culture* 

25 BRIEF DESCRIPTION OF THE D RAWINGS 

Figs . i ) 2 and 3 of the drawing show the complete 
base sequence for the E. coli penicillinase gene carried 
on the plasmid pBR322 along with the corresponding amino 
acid sequence of the protein for which it codes. 

30 BEST MODE OF CARRYING OUT THE INVENTION 

The following specific example is intended 
to Illustrate more fully the nature of the present 
invention without acting as a limitation upon its scope. 
Example 

35 There was employed as the carrier protein E . coll [ Escherichia 

coli ] penicillinase, the gene for which is carried on the 
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15 



25 



30 



small plasmid pB R322. A restriction enzyme map of this 
gene is shown in the drawing. This plasmid vector has 
been described by Bolivar_et al. , Gene, 2. 95-113 
(1977). As the host bacteria we employed V coli 1776*- 
5 see Curtisse^ja. in Recombinant Molecules : Impact on 
Science and Society, Proceedings of the Tenth Miles 
International Symposium, eds. Beers& Bassett, 45-56 
(1977). The host-vector combination is a certified EK2 

system, certified by the Unitpd q<- a t- Q o t.t.,*.- n x 

u y united btates National Institute 

10 of Health (NIH) , July 7, 1977. 

The plasmid carries a Pst [providencia stuartii 
endonuelease] restriction site of the penicillinase gene cor- 
responding to die position of amino acids 181 and 182, as shown in 
the drawing. Double stranded cDNA was synthesized fron RNA contain- 
ing preproinsulin in RNA (PPI-mRNA) isolated from an X-ray 
induced, transplantable rat B-cell tumor (Chick et al 
Proc. Natl. Acad. Sci. USA [P.N.A.S. ], 74, 628-632~U977l) 
Batches of 20 g each of frozen tumor slices were ground 
with sterile sand with mortar and pestle and the 
cytoplasmic RNA purified from a post-nuclear supernatant 
n,yL Hf Pitati ° n (Palmiter ' Biochemistry, 13,3603-3615 
ThT II T hY 6XtraCti0n With Phenol and chloroform. 
This RNA was further purified by aligo-dT-cellulose 
chromatography (Aviv et al. P.N.A.S.. 69, 1408-1412 (1972)) 
and used directly as template for double- stranded cDNA 

llT^t^ d6SCribed CEfatratiadis- et al . , Cell. 7, 279- 
288 (1976)), except that a specific (dpT) 8 dpGpC primer 
(Collaborative Research) was utilized for reverse trans- 
cription. The, concentrations of RNA and primer were 
7 mg/ul and 1 mg/]ll> respectively. All four a- 32 P-dNTPs ■ 
were at 1.25 mM (final specific activity 0.85 Ci/m mole) 
The reverse transcript was 2% of the input RNA, and 257. ' 
of it was finally recovered in the double-stranded DNA 
product. 

^^eTS^P^^iiJ. 17 ^ ^ placed with and made 

Sockville Maryland Tl % a J? 1 * 1 ?* 1 ^ 0iLcare Election, 
^viiie, Maryland, U.S,A. and has been assigned ATCC No. 31244. 



20 
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15 



The double-stranded cDNA was inserted into 
the Pst site of plasmid P BR322 by the following procedure: 
PBR322 DNA (5.0 yg) was linearized with Pst and approxi- 
mately 15 dG residues were added per 3' end by terminal 
transferase at 15°C in the presence of 1 tnM Co" (I^choudhury 
et al . Nucleic Acids Res.. 3,101-116(1976)) and 100 pg/ml 
^toclaved gelatin. The same procedure was used to add 
dC residues to- 2.0 yg of double-stranded cDNA. The 
reaction mixtures were extracted with phenol, and 
ethanol precipitated. The dC tailed double- stranded 
CDNA was electrophoresed in a 6% polyacrylamide gel 
under native conditions. Following autoradiography, 
molecules in the size range of 300 to 600 base pairs 
( 5 ur) were eluded from the gel (Ef stratiadis et 
al in Methods in Molecular Biology. 8.1-124 (1976)). 
The eluted double- stranded cDNA was concentrated by 
ethanol precipitation, redissolved in 10 nuM Tris P H 8. 
mixed with 5 yg dG tailed P BR322 and dialized versus 
0 1 M HaCl. 10 m EDTA, 10 mM Tris pH 8. The mixture 
►0 (4 tnl) was then heated at 56° for 2 minutes and anneal- 
_ , af . ir for 2 hours. The hybrid DNA was 
ing was performed at 42 tor i nou . 

P««n- v eoli 1776. The use of oligo 
used to transform E. coil lnQprt - 
dC-dG joins regenerates the Pst cuts so that the insert 

may be later «**.ed. ■ „ 76 (an EK _ 2 ho8t 

,c Transformation of E. coti x 

with PBR322, an EK-2 vector) waa perforce ,. • • 
sa£ety cabinet In a P3 physical containment in 
c „pliance with H.I.H. guideline., f.r r..«M« « D,U 

r( , qparch published in the ¥eaeraL_*± u 

researcn P""^ „ d bv a transection pro- 

1776 was transformed oy 

cedure (Enea et al. . J.Mol.Biol.. : J>. 

ceu v 1776 was i?r<*m In U 

slightly modified as follows: x i • 

Broth [10 gms tryptone.5 gm yeas ^ ^^ Hc 

(Difco)] supplemented with 10 vg/* « 



<i 0 V A 

20o"ml portion of cells were sediment^ 



35 acid and 40 . g /»l thymidine (Si**.)" ^ ^ 
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resuspended by swirling in 1/10 volume of cold buffer 
containing 70 mM MnCl 2 , 40 mM NaAc pH 5.6, 30 mM CaCl 2 
and kept on ice for 20 minutes. The cells were repelleted 
and resuspended in 1/30 of the original volume in the 
same buffer. The annealed DNA (2 ml) was added to the 
cells. Aliquots of this mixture (0.3 ml) were placed 
in sterile tubes and incubated on ice 60 minutes. The 
cells were then placed at 37° for 2 minutes. Broth was 
added to each tube (.7 ml) and the tubes incubated at 
37° for 15 minutes. A 200 jjI portion of the cells was 
spread on sterile nitrocellulose filters (Millipore) 
overlaying agar plates containing 15 jig /ml tetracycline. 
(The filters were boiled to remove detergents before 
use.) The plates were incubated at 37° for 48 hours. 
Replicas of the filters were made. The nitrocellulose 
filters containing the transformants were removed from 
the agar and placed on a layer of sterile Whatman 
filter paper. A new sterile filter was placed on top 
of the filter containing the colonies and pressure was 
applied with a sterile velvet cloth and a duplicate 
block. A sterile needle was used to key the filters. 
The second filter was placed on a new agar plate and 
incubated .at 37° for 48 hr. The colonies on the first 
filter were screened by the Grunstein-Hogness (P.N.A.S., 
72, 3961-3965 (1975)) technique, using as probe an 80- 
nucleotide long fragment produced by Hae III digestion 
of high specific activity cDNA copied from the rat 
oligo-dT bound RNA. Positive colonies were rescreened 
by the HART method (Paterson et al. , P.N.A.S., 74, 
4370-4374 (1977)) as follows! Plasmid DNA (about 3 pg) 
.was digested with Pst, ethanol precipitated, and dissolved 
directly into 20 jjL dionized formamide. After heating 
for one minute at 95° each sample was placed on ice. 
After the addition of 1.5 ug oligo (dT) -cellulose bound 
RNA, PIPES at pH 6.4 to 10 mM and NaCl to 0.4 M, the 
mixtures were incubated for 2 hr at 50°. They were 
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then diluted by the addition of 75 \il and ethanol 
precipitated in the presence of 10 yg wheat-germ tKNA, 
washed with 70% ethanol, dissolved in H 2 0 and added to 
a wheat-germ cell-free translation mixture (Roberts et 
5 al, .P.N.A.S. , 70, 2330-2334 (1973)). After three hours 
at 23°C, duplicate 2 pi aliquots were removed for 
trichloroacetic acid precipitation; the remainder of 
the reaction mixture was treated with ribonuclease , 
diluted with immunoassay buffer, and analyzed for the 

10 syntheses of immunoreactive preproinsulin by means of 
a double antibody immunoprecipitation (Lomedico et 
al. Nucleic Acids Res., 3, 381-391 (1976)) .The washed 
immunoprecipitates were dissolved in 1 ml of NCS 
(Amersham) and counted in 10 yl of Omnifluor (New 

15 England Nuclear) by liquid scintillation. 

One colony was identified by the HART 
Screening. The Pst excisable insert was sequenced by 
the method of Maxam and Gilbert (P.N.A.S., 74 t 560-564 
(1977)) to show that It corresponded to the sequence of rat 

20 preproinsulin I. This insert, labeled by nick transla- 
tion with DNA polymerase I was used to screen 200 
transformants with the Grunstein-Hogness assay. There 
were identified 48 clones containing rat preproinsulin 
cDNA inserts. 

25 These 48 clones of transformed E. coli x 1776 

were screened using an in situ radioimmunoassay technique 
to determine whether the clones were producing insulin 
antigens and whether they were producing fused polypeptide 
chains, one end of which being insulin antigen and the 

30 other end penicillinase (the bacterial carrier protein) 
antigen. Presence of the fused polypeptide chains would 
indicate that the clones contained genes which were the 
products of the fusion of the bacterial gene for penicil- 
linase with the eukaryotic cell gene for insulin. Such 

35 fused polypeptide chains were in fact found, using the 
technique to be. described below. The technique takes 
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advantage of the fact that the fused proteins being 
searched for contain two antigenic ends, each of which 
will bind to its respective specific antibody. A specific 
antibody was laid down on a plastic disk, the antigenic 
5 protein from lysed bacterial cells placed in contact 
with this disk, then the disk was rinsed and exposed 
to radioactive antibodies. A protein molecule will 
bind to the antibody fixed to the plastic with one 
antigenic determinants and will bind in turn a radio- 

10 active antibody with a second determinant. If anti- 
penicillinase is on the disk and anti- insulin is 
labeled, after the "sandwich" is washed, the only points 
of radioactivity remaining will mark the presence of 
fused proteins. In more detail, the method was as 

15 follows: 

Each 8.25 cm diameter disk of clear polyvinyl 
(PV) 8 mm thick (Dora May Co., New York) was flattened 
between sheets of smooth paper. In a glass petri disk, 
each disk was then placed upon the surface of a liquid 

20 containing 10 ml of 0.2 MNaHCO^ , at pH 9.2, containing 
60 yg/ml IgG. After 2 minutes or longer at room 
temperature, the disk was removed and washed twice with 
10 ml of cold wash buffer (WB) , which consisted of 
phosphate-buffered saline, 0.5% normal guinea pig serum, 

25 0.1% bovine serum albumin and 0.3 mg/ml streptomycin 

sulfate. Each disk was used immediately after washing. 

Antigens were released from bacterial cells 
by transferring colonies onto 1.5% agarose containing 0.5 
mg lysozyme/ml, 30 mM Tris pH 8, and 10 mM EDTA. The 

30 IgG-coated surface of a PV disk was placed face down on 
the agarose and bacterial colonies and left for 60 
minutes at 4° . Each disk was then removed and washed 3 
times with 10 ml of cold WB. This step completed the 
immunoadsorption of antigen onto the solid-phase antibody 

35 layer. 

Reaction of the tAJ I-labeled antibodies with 
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the antigen now adhering to the disks was done by setting 
1.5 ml WB containing 5 x 10^ cpm (y emission) -^^I-IgG onto 
the center of an 8.25 cm diameter flat disk of ordinary 
nylon mesh which had been placed in the bottom of a petri 
5 dish. The mesh served as a spacer. A disk treated as 
in the earlier steps then was placed facedown on the 
mesh and solution and incubated overnight at 4° . Each 
disk was then washed twice with 10 ml cold WB and twice 
with water, and allowed to dry at room temperature. At 

10 this point, fused proteins had bound to both the 

ordinary and radioactively labeled layers of IgG. These 
proteins were then detected with conventional autoradio- 
graphy technique using Kodak No. Screen Film or Kodak 
X-OMAT R film and a DuPont Cronex Lighting plus intensify- 

15 ing screen as described for example by Laskey ^t jal., FEBS 
Lett., 82, 314-316 (1977). Both anti-insulin 
and anti-penicillinase IgG fractions were required for 
the procedure above. The anti-insulin antiserum was a 
commercially available product obtained from guinea 

20 pigs. The rabbit anti-penicillinase ant i- serum was 
produced by injection (1 mg pure) penicillinase (in 
complete Freund's adjuvant (Difco)) into New Zealand 
white rabbits. (Booster injections were administered in 
incomplete Freund's adjuvant (Difco)) 2 and 3 weeks after 

25 the initial injection, and the rabbits were bled 1 week 
later. 

The IgG fractions were prepared from each 
immune serum by ammonium sulfate precipitation followed 
by DEAE-cellulose (Whatman, DE-52) chromatography in 

30 0.025 M potassium phosphate, pH 7.3, 17, glycerol. 
Fractions containing the bulk of the flow- through 
material were pooled, and protein was precipitated by 
adding ammonium sulfate to 40% saturation. The resulting 
pellet was resuspended in 1/3 the original serum volume 

35 of 0.025 M potassium phosphate, pH 7.3, 0.1 M NaCl, 1% 
glycerol, and dialized against the same buffer. After 
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dialysis, any residual precipitate was removed by centri- 
fugaticm. IgG fractions were stored in aliquots at -70°. 

Each IgG fraction was radioiodinated by the 
usual method of Hunter et al. , Biochem. J., 
91, 43 - 46 (1964). The 25 yl reaction mixture 
contained 0,5 M potassium phosphate, pH 7.5, 2 mCi 
carrier- free Na^i, 150 pg IgG and 2 ug chloramine T> 
After 3 minutes at room temperature, 8 pg of sodium 
metabisulfite in 25 \il PBS was added, followed by 200 ul 
PBS containing 27. normal guinea pig serum. The 125 I-labeled 
IgG was purified by chromatography on a Sephadex G-50 
column equilibrated with PBS containing 2% normal guinea 
pig serum. The 125i_i gG elution fraction was diluted 
to 5 ml with PBS containing 10%' normal guinea pig serum, 
filtered through a sterile Millipora VC filter C0.1 pm 
pore size), divided into aliquots and stored at -70°. 
The specific activies were 1.5 x 10 7 cpm/ug. 

This screening detected one clone of x 1776 
that synthesized and secreted a fused protein showing bath 
penicillinase and insulin antigenic determinant. This 
protein, recovered from the periplasm, mimics insulin 
in radioimmunoassays. DNA sequencing shows that this 
protein is a fusion between penicillinase and proinsulin, 
the two proteins being connected by 6 glycines between 
amino acid 182 of penicillinase (Alamine) and amino acid 
4, glutamine, of proinsulin. Thus a higher cell hormone 
has been synthesized In bacteria in an antigenically 
active form. 

It will be appreciated that the DNA sequence • 
for the desired eukaryotic cell protein can be inserted 
into a Hind II cut corresponding to the position between 
amino acids 101 and 102 of the protein for which this 
pBR322 plasmid codes, or into the Taq cut at the 
position corresponding to amino acid 45. In all cases, 
if the eukaryotic cell DNA is arranged in phase, by the. 
random addition of tails or by other procedures, It will 



Board Assigned Page #1 968 



0006694 



-n- 

be expressed as a fused part of the carrier protein; and 
the protein excreted from the cell. Furthermore, the 
sequence of the penicillinase gene, as it exists in this 
• plasmid, or in others, can be modified either by mutation, 
5 or by direct recombinant DNA techniques such as the 

insertion of DNA fragments at specific points within the 
gene, in such way as to insert new restriction cuts that 
are convenient for splicing. For example, the Rl cut 
on the plasmid pBR322 can be removed by mutation, and an 

10 Rl sequence inserted by ligation into the penicillinase 
gene. Although this might inactivate the gene, it 
would not interfere with the use of this region of DNA 
to synthesize a carrier protein. 

The segment of the penicillinase gene DNA 

15 between the code for amino acid 23 at the end of the 
hydrophobic leader and the code for amino acid 45 at 
the Taq cut for example, can be removed by nibbling 
back the DNA by a mixture of appropriate enzymes . One 
such mixture is the lambda exonuclease which will chew 

20 back the DNA strand from the 5' end, together with the 

enzyme SI, which will remove the single stranded overhang. 
Another such mixture is T4 DNA polymerase which will 
chew back the 3' end of one DNA strand together with SI, 
which again will remove the single stranded overhang. 

25 By controlled digestion the plasmid DNA molecule can 
be appropriately shortened to the fragment extending 
from the Rl cut to the point coding for amino acid 23 
or to other points on the hydrophobic leader sequence, 
and such a fragment can be fused to a similarly generated 

30 fragment containing the insulin sequence, chewed back 

enzymatically to a convenient initial point, presumably 
again, the point where the mature insulin molecule 
begins. These two fragments can be fused together, for 
example, by butt end ligation by the T4 DNA ligase, and 

35 that fusion inserted into the plasmid. That fusion 
produces a degenerate species of the carrier protein, 
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for which the carrier gene codes for only the E.' coli 
hydrophobic leader sequence and the eukaryotic cell 
gene provides the rest of the structural information. 
Although such construction can in principle be done 
5 exactly, in practice they will probably be done on a 
random basis, involving the splicing of a variety of 
gene fragments whose end points are in interesting 
regions, and examining the medium surrounding clones 
of bacteria transformed by the fused fragments to detect 

10 antigenic activity by an RIA such as the one described 
above, as evidence of protein synthesis. 

The procedure of the present invention is 
not restricted to the use of the E. coli penicillinase 
gene, but is applicable to the gene for any excreted 

15 protein carried on a multicopy plasmid or on a phage. 
It is not restricted to insulin, but can be used to 
find the expression of the fused protein of any DNA 
fragment of a virus or eukaryotic cell that carries a 
coding region that codes, when translated in phase, for 

2.0 antigenic determinants in the viral or eukaryotic cell 
protein. Thus if fragments of animal virus DNA are 
inserted into the Pst or Hind II site of the penicil- 
linase gene, some recipient bacterium will synthesize 
a fused protein which will be recognizable by using 

25 the RIA technique, employing antibodies specific to the 
viral antigen. This fused protein in turn can be 
purified and used to stimulate an antibody response in 
an animal or person, either for the production of 
antibodies directed at specific sites on the virus 

30 protein, or as vaccination against the viral antigen. 
The fused protein will provide helper determinants in 
such a vaccination, to aid the immune response, although, 
presumably, aggregated states of the fused protein would 
have to be used in a vaccine. The specific carrier 
35 proteins that would be used might be either the bacterial 
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proteins themselves or still further fusions between the 
bacterial proteins and other convenient sequences to 
provide useful helper determinants in carrier protein. 
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l * A me thod of making a selected protein or 

portion thereof by inserting DNA representing the selected 
protein or portion thereof into a bacterial gene, char- 
acterized by cleaving the bacterial gene for an extra- 
cellular or periplastic carrier protein, inserting into 
the cleavage site by a recombinant step a non-bacterial 
DNA fragment which codes for the selected protein or 
portion thereof, transforming -a bacterial host with 
the recombined gene,. and culturing the transformed 
bacteria to excrete the selected protein or portion 
thereof. 

2- The method according to claim 1, characterized 

in that the selected protein is a eukaryotic cell protein 
which contains a hydrophobic leader which is normally 
cleaved during excretion from the eukaryotic cell. 
3 ' The method according to claim 1 or 2, char- 

acterized in that the carrier protein is E. coli penicil- 
linase. 

4 - The me thod according to any of claims 1 to 

3, characterized in that the selected protein is insulin. 
5 " The method according to claim 1, character- 

ized in that the carrier protein is E. coli penicillinase 
and the bacterial gene is cleaved at the Pst restriction 
site. 



Board Assigned Page #1 972 



0006694 




CD OCJ 
CD OO 



CD 


(_> 


CD 


CJ 


CD 


c_> 


CD 


CJ 


CJ 


CD 




AA 






CJ 

<£ 


f2 


C_) 


CD 


CD 


CJ 




CJ 


■ £ 


<c 


CD 


OO 


CD 


ooO 


«=C 


1— 


CJ 




1 — 




CD 


CJ 


CJ 


CD 


<t 


I— 


CD 


CJ 


«C 


1 — 


»— 




*— 




CJ 

t 


CD 

•a: 


GT" 


CJ 


CD 


CJ 


t— 


<£ 


s 


H- 
OH 


1 — 


voca: 


AA 


TT 


(— 




<c 




CD 


CJ 


H- 


«C 


<C 


H- 


CJ 




1— 


% 


CD 


CJ 


h- 


<c 


<C 


i — 


<c 


t— 


1 — 


<x 


1 — 


<c 


CD 


CJ 


CD 


CJ 


«c 


J— 










t 




AT1 


H~ 


i— 


<C 


CJ 


CD 


CJ 


CD 


CD 


CJ 


o 


CD 


<c 




1 — 


<c 


<c 


*— 


CD 


o 


1 — 


<c 


CD 


CJ 


C_> 


CD 


1— 


<=><c 


* — > 


C* CD 




CD 


(J3 


CJ 


CD 


CJ 


CD 


CJ 


<C 


i — 




1 — 
1 — • 


CD 


CJ 




CD 


«c 


1 — 


CD 






1 — 
1— 


CD 


<_> 


TT 


AA 


CJ 




1 — 




-HI— 


<c 


OS <=C 


1 — 


SC 


h- 


C3 


cj 



CJ -*cD 
I — -t<X 







^? 




CD 


* — * 




f— 




1 — ■* 








o j 






NCD 


CJ 


O <o«ar 


1 — 


<** :*y=5 


1 — 




1 — 


«>CD 


CJ 


«l— 


<c 


£*CD 


CJ 


3 CD 


CJ 


<8h— 


■=c 


MCJ 


CD 


^.CD 


CJ 


.co 


CD 




t— 


3<5 


H- 




1— 


OCD 


CJ 


o<C 


1 — 


fiCJ 


OCD 


OiCJ 


00 CD 


<o<_> 


(N CD 


s*«sc 


f— 


iSCJ 


CD 


— <JH- 


<C 


"O 


CD 


tCD 


CJ 



«CD 
«l — 
*«C 
«CD 
t»l — 

t.CJ 
&.CJ 

ttl — 
•51— 

f,CD 
-<CJ 

OCD 

OCD 
O QCJ 
VO fcCJ 
O.CJ 
<3>CJ 
t,CD 
^CJ 

<sl — 

A.H- 

uh- 

<SCD 

«0<C 
3CD 

OCD 

31— 

Wl 

>^CJ 

<3CJ 

«t— 
H<C 
BCD 

•j<T 
»J— 

OCD 

WCD 

O CCJ 
ui <o«=C 

3CJ 
i»l — 
MCJ 

ni— 
«<C 

-aCCD 
3CD 

Q| 

»-»CJ 

OCD 
)CJ. 



"3* 



<«a: 
r,cj 

a>«C 
6-<H- 
— 
^ CD 
OCD 

MCO 

at— 

N-CD 
<M<C 

CCD 
^CJ 
O «<C 

"<CD 
a>— 

OCD 
3U3 



CCD 

OCJ 
o>- 
■o<C 

•^CD 

OCD 
tJH- 
NU 
<CCD 



CJ 

■a; 



CJ 



o CD 
o cd 

s 

CJ 
CD 



CJ 

t= 
CJ 
CD 
CD 
CD 
CD 
CJ 

o cp 
00 «c 

"5 

CJ 

I — 

CJ 

t— 

CJ 



CD 
CD 



CJ 
O CJ 
ID CD 
no 
H— 
CD 



CD 
<C 
CD 

<c 
I — 

CJ 
CJ 

<C 

CD 



eg 



<C 

CJ 
CJ 
CJ 

<c 

CJ 

I— 

CJ 

CD 

I— 

CD 
CJ 

<x 

CJ 
CJ 

00 

(M<C 

n< 
CJ 

I— 

CD 



CJ 

<c 

CD 
CJ 



►J I — 
a 1 — 

t>- CD 
O 3 CD 
OOI — 
riOt— 
D.CJ 
«) <C 
■cc CD 
C f— 
<a <f 
<C 
C CD 

^ «=c 

O iO 

t, t— 

IS CJ 

St 

ta cj 

M <C 

a: cj 
« <C 
I— 
H<I 

<3>CJ 

t. CD 
«?: cj 

ftjCJ 
t. CT3 
•q; CJ 
o a>| — 
a* <~s cd 

O CD 
3 CJ 

<8 I 

M CJ 



O CJ 
3 CD 

O CD 
C <C 

O CJ 
3, CD 
*-i CD 
O CD 
«CJ 

r-l tj 

-< CD 
Q.CJ 
<a <C 

"=C CD 



^ CD 
OjI— 
t, CD 
CJ 
O t, CJ 

00 <u cj 
to I 

3 <C 
•St ) 

^> I— 
a 1 — 

CD 

a cd 
~ CJ 

CD 
a>CJ 

CD 
O CD 



CJ 
3 CD 
» I— 
^ CJ 

O I— 
^ CD 
«j <£ 

M <C 
O ) — 

r- .-c 1— 
ft- V— 
U J— 
^: CJ 

u CJ 
<4 CD 
to <c 



CJ 
CJ 



CJ 
OH- 



CD 

s 

CD 

CD 
H- 
<Z 
I — 
CD 
CJ 
OCD 
00 CD 
-rCJ 

CJ 
CJ 
CD 

<c 

CD 
H- 
1 — 
CD 
CJ 



CD 
CJ 

ou 

VDCJ 

via 
CD 
CJ 
CD 
I — 
CJ 

CJ 

<c 

CJ 
CD 
CD 



CJ 

CD 
cj 
CD 
CJ 
CJ 

<c 

CJ 

CD 
CJ 

<c 

CD 
OCJ 

Mi — 

•Tl 

t— 

<C 

<x 

CD 
f— 
CD 
CJ 



o 

LJL 



CO 
N- 
03 

CD 
D5 
CD 
CL 

X3 
CD 



03 
< 



CD 
O 
CO 



0006694 



3<t 
<B I — 

•-51— 

ccj 
<o<c 

a (_j 

♦s4C_> 

■^CD 

ocd 
•>»cj 

■sCCD 
Ct— 
-CCJ 
t-t<£ 
o c<_> 
m <o<r 
»~i 

tu— 

«t«C 
*«CD 
M— 
«>CD 
«5<£ 

«J— 

*:<«. 
ccj 

U<r 
o> «=c 
t— 

H <C 
« CJ 

*-» CJ 

■«>: cr> 
a h- 
»~» cj 
"<c cd 
*. J— 

<s o 

to <r 
e> CJ 

2r>CO 

on- 
es a-d: 

«N <»| 

■ — I ^ — 

O CD 

o>«x: 

C CD 

«< <c 

O H— 
ii. CD 

c «=c 
c_> 

S-i <t 
+i CO 

« 1 — 
*>=c 

»-» CD 
O CD 

CD 
c CD 
* CJ 

Ei«=C 
a I — 
w I — 
^c_> 

O tt) |— 

»-t •«» <C 

» CD 

OCD 

•C CJ 

i-J c_> 
Of— 
N CD 
O "CC 
C CJ 

c«a: 

<8CJ 

tot— 
ccj 

a><C 
t-il— 
3CD 

OCD 



CD 



CD 
CD 
CJ 
CJ 
CD 
ou 
o <n 

vocD 

i— 

CO 



CJ 
<c 

CJ 
I — 
CJ 

-=c 
I— 

CD 
CD 



oo | — 
in CD 

CD 
CJ 
<C 
CD 
CJ 

-a: 

CJ 

i — 

CD 
CJ 



CJ 

o| 

VOCJ 



CJ 



CJ 

<c 
I — 

CD 
CJ 
CJ 

•a: 

I — 
CJ 
CJ 
CD 

o|— 
ir><£ 

CD 

<c 

I — 
CD 

CJ 

f— 



CD 

«=c 

CJ 
I — 
O CD 
<N CD 
tr>( — 
CD 

<c 

CD 
I— 

«=C 

CJ 



act 

OCD 
CI— 

^ 
3CD 
Wl — 

MCJ 

acD 
«^><C 

OCD 
OCD 
CCJ 

q<cj 

OCD 
DJLD 
CCD 
E-,1— 
Crf— 
CCD 
"CCJ 

<s«a: 

"SCD 
O 3f— 
VO Q| — 

•H MCJ 

OJCJ 
CCD 

"ccj 
ch- 

•CCJ 

61 «=c 
01— 

ia>CD 
03 1 — 
M<C 
&:CJ 

oj— ■ 
<o«a: 

-5CD 
3/-D 
t-»CD 

ocd 
«CD 

CO 
B< 

OJCJ 

v<C 

*!CJ 
O 3CD 
ir> isl — 



&.( — 
01— 
t-*CJ 
"JCCD 
t.CJ 
J=CJ 
E-«<a: 

a<C 
»i — 
mj 

3CD 

OCD 
to CD 

OCD 
t,CJ 
ft,CJ 

•~»CD 
OCD 

t-i CD 
CJCD 
O too 
•«r m|— 

C CD 
-CCJ 

j;CJ 

k-ci: 

3 CD 
oil — 
^ CJ 



CJ 



CJ 

<x. 

CD 
CJ 
H— 
CJ 
CJ 
OCD 
OCD 
C^l— 
1— 
CJ 
CJ 
CJ 

CJ 

<c 

I— 
CJ 

<c 
«c 

CD 
CD 
CJ 
CD 

o<c 
00 CD 
us I 

J— 

CJ 

<c 
h~ 

CD 
<C 
I — 
CJ 
CJ 
CJ 
CJ 
CJ 



o 1 

UJCD 
«>l— 
CD 
CJ 

CD 
CJ 
CD 
CD 



CD 



TO 
VOCJ 



CD 
CD 
I — 
CJ 
CJ 

I— 

CJ 
CJ 

CD 
<3Z 



(N CD 
«J I — 
CJ 
<C 
CD 

<x. 

CD 



-5 


TT 


^3 CJ 


CD 


OlCD 


CJ 


t»CD 


0 


«<CJ 


CD 


U CJ 


CD 


0>CJ 


CD 


<0l — 
O Ol — 




O t^>CJ 


CD 


04 ^CCD 


CJ 


a<C 


1 — 


«K— 


<c 


MCJ 


CD 


r<l — 


o<a: 


JJCJ 


OCD 


Eh<C 


CO ( — 






^ CJ 


CD 


3<C 




»>— 




MCJ 


CD 


3<r 

M<C 


TT 


OCD 


CJ 


*CJ 


CD 


i~»CD 


CJ 


OCD 


CJ 


U\— 


<C 


A^CJ 


CD 


h<r 
3^ 


TT 


^1 — 


oS 


3<C 


col— 


<»l — 


t~<c 


^CJ 


CD 


0 to«H 


H— 




t~ 




t— 


tJjCJ 


CD 


Is CD 


CJ 


n;cj 


CD 




CJ 


tt^— 


<C 


— 




J^CD 


3 


JCO 


CD. 




TT 


•UCJ 


CD 




TT 


t->CJ 


O CD 


■=ccr> 


VOCJ 


+1 CD 


r-CJ 


«l — 




S<C 


1 — 




CD 


vf-j 


CJ 


0<C 


1 — 


t-iCJ 


CD 


•^CCD 


CJ 


Ol — ■ 


<Z„ 


i, CJ 


CD 


O.CJ 


CD 


O +»CD 


CJ 


00 <»| — 


<c 


r-< S<C 




*<CD 


CJ 


^;cj 
6-.«aC 






S.CJ 


O CD 


rf!CJ 


•*CD 


E-i<C 




oy 


CD 


»<c 


1 — 


"CCD 
crJ — 


AC 


CCD 


CJ 


-<CJ 


CD 


3 CD 


CJ 




1 — 


OCD 


CJ 


n/uj 


CD 


BCC 


1 — ■ 


^CD 


CJ 


KCJ 


CD 


<o<a; 


1 — 




h— 




t— 


CCJ 


CD 


IX, CJ 


O CD 




tN| — 


<~*i — 






f— 


O OCJ 


CD 


mo 


CD 


r-l ^tLD 


CJ 



UCJ 





<N 


Ol— 




t CJ 


^ c3 


CD 




f— 


1 — 




AC 




1 — 




CJ 




CJ 


^0 1 — " 


<c 


»r, - 

^ <t 


t "" 


l^h- 


<c 


c — > 


CD 


«S 

0<3. 


i — 


»^» CJ 


CD 


^ CD 


CJ 


CD 


OCJ 




OH 


O CJ 


OCD 


Ul — 


H< 


QJCD 


CJ 


to«a: 


H- 


SiCD 


CJ 




CJ 


OCD 


CJ 


•CCJ 


CJ 
CD 




t— 


CCD 


CJ 


j:cj 


CD 






O CCJ 


CD 




1 — 


M E-al — 


<c 


»CJ 


CD 






N«a; 


I — . 


t-*t— 


oS 


OH— 


CO «5 


t»-CD 






j 


Ol — 




ikCD 


t_> 


tacj 


t-O 




«c 




|— 


&J — 


<x 


CCD 




•^CJ 




ccj 


( rp 


»CJ 




tot— 


<c 


OCJ 




ccj 




RiCJ 




03 CD 


t_> 


a>«a: 


I — 


4< 

a>l — 


ot— 


MO 




OCD 




0 — 




in oxc 




M nccD 


CJ 




1— 


CCJ 


CD 


&,<_> 


CD 


3>CD 


CJ 


MCD 


CJ 


OCD 


CJ 


3CD 


CJ 


«l — 


<c 


*-3CJ 


CD 


o<c 


1— 


*--»CJ 


CD 


^CD 


CJ 


n err" 


h- 


-><_> 


O CD 


^?CD 


■O-CJ 


(SI 






m ^ 




1 — 


t»CJ 


CD 

<x 


H<a: 

a*— 


AT 


«ej 


CJ 


OCD 


CJ 


OjCJ 


CD 


CCD 


CJ 


^CJ 


CD 


O C(— 


<a: 


•V 01CJ 


CD 


<N Vll — ' 




->JLD 


CJ 


»-*CD 


CJ 


OCD 


CJ 


Cnl — 


o< 


CCD 


r-) CJ 


CJ 


o»CD 



CM 

O 
LL 



Board Assigned Page #1 974 



0006694 





O | 


cd 


O C-J 


£ 






( 1 ^-J- 




<C 




<C 


<_> 


CD 


«c 


1 — 


1— 


<c 


<c 


1— * 




<c 


fee 


. h— 


h- 


<C 


<c 


1 — 


c_> 


CD 


1 — 


<C 




CD 


<c 


J— 


1 — 


<c 


1 — 


<c 


1 — 


0<C 


CO 


oocj 


AA 


ol — 
iH | — 


c > 


CD 


t_> 


CD 


<C 


I— 


CD 


<_> 


<C 


t— 


c_) 


CD 


h- 


<c 


CD 


<_) 


I— 




C_> 




«=C 




<c 


h- 




<C 


gad 


c_> 


CO S,CD 


c_> 


M t.1 — 


<C 


to J 


0<C 


-.»<=; 


<£>|__ 


a:c_j 


OCD 


» CD 


^c_> 


5$ 


TT 


<ul 


<e 


1 — 


<r 




i — 


3CD 


c_> 


<ul 


<c 




CD 




t— 


CD <_) 


CD 


IQ| 


<c 


o «o 


CD 


00 WO 


CD 


^D 


C_J 


— 


<C 


to CD 


CJ 


dCD 


oo 


<»«=C 




t~»J — . 


o«a: 


»~i<C 


r-ll 


3 CD 


CJ 


>-i«=C 




<JCD 


G 


at— 


«a: 


MO 


CD 


"*CD 


c_> 


<»«_-> 


CD 


*~>H- 


<c 


>~i<C 


1 — 


CCD 


o 


*-»«cC 


1— 


C5<_> 


CD 




t— 


*<co 


CJ 


n<C 


t— 


KH 


c£ 


«o<C 


1 — 




h— 



CO 

6 

LL 



LO 
N- 
03 

CD 
CO 
CD 
CL 

X3 
CD 



03 
< 



CD 
O 
CO 



TAB GG 



Board Assigned Page #1 976 



JAN 



29 2007 

"1 ■ 



16 : 47 FR CISTI ICIST 



613 952 9303 TO 12122308888 P. 03/2 



Ann. Rev. Microbiol 1976. 30:507-33 

Copyright e 1976 by Annual Reviews Inc. All rights reserved 



GENETIC MANIPULATION OF 
MICROORGANISMS: POTENTIAL 
BENEFITS AND BIOHAZARDS 

Roy Curtiss III 

Department of Microbiology, Institute of Dental Research, Cancer Research and 
Training Center, University of Alabama, Birmingham, Alabama 35294 



CONTENTS 

INTRODUCTION 507 

CURRENT STATUS OF TECHNOLOGY 508 

POTENTIAL BENEFITS 509 

Introduction 509 

Acquisition of Basic Knowledge 509 

Applications of Technology , 5 \Q 

POTENTIAL BIOHAZARDS 511 

Introduction 5H 

Factors Affecting the Probability of Escape of Chimeric Microorganisms 512 

Factors Affecting the Probability of Survival and Perpetuation of Recombinant 

ONA 514 

Factors Affecting Probable Consequences of Escape and Survival of Recombinant 

DNA Molecules .... . 518 

Estimating the Probability of Manifestation of Adverse Consequences by 

Recombinant DNA Molecules , 519 

INTENTIONAL RELEASE OF GENETICALLY ALTERED MICROORGANISMS 527 



For Beneficial Purposes 

For NonBeneficial Purposes 

COPING WITH POTENTIAL BIOHAZARDS 

DATA NEEDED TO EVALUATE POTENTIAL BIOHAZARDS OF CHIMERIC 

MICROORGANISMS 

CONCLUDING REMARKS 



INTRODUCTION 

It has recently become possible to enzymatically couple viral, procaryotic, or 
eucaryotic DNA to bacteriophage or plasmid-cloning vector DNA and to introduce 
these recombinant DNA molecules into bacteria to clone and ultimately to allow 
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study of the expression of the foreign genetic information (15, 37). The potential 
applications of these techniques in obtaining basic information about life processes 
and in attempting to contend with some of the major problems facing our society 
stagger the mind (48, 74). On the other hand, the use of these newly acquired 
techniques for the genetic manipulation of microorganisms and their ultimate adap- 
tation for the genetic manipulation of eucaryotic organisms is also associated with 
potential risks. In this review, I consider only briefly some of the potential beneficial 
applications of these techniques; instead, I dwell more comprehensively on the 
potential biohazards that are of more imminent concern and the means to contend 
with them. Considerations are restricted principally to the cloning of DNA in 
microorganisms (i.e. procaryotic organisms for which cloning techniques are al- 
ready developed and other free-living, generally unicellular, lower eucaryotic organ- 
isms, such as algae, fungi, and protozoans, for which cloning techniques are not yet 
developed). Such genetically altered microorganisms that possess genetic informa- 
tion from species that do not normally exchange genetic material with the host 
microbe are referred to as chimeric microorganisms. 

CURRENT STATUS OF TECHNOLOGY 

Cohen (15) has recentl;- reviewed the historical developments leading to the ability 
to clone foreign genes in the bacterium Escherichia coli K 12. In order to prepare 
recombinant DNA molecules in vitro, DNA is isolated from an organism of choice, 
either with or without suitable enrichment for specific gene sequences, and then one 
of two methods is used to construct recombinant DNA molecules. In one procedure, 
the DNA is fragmented either enzymatically or by shearing, then treated with an 
exonuclease that digests a single DNA strand in the 5' to 3' direction; then single- 
stranded homo-deoxypolymeric sequences are added to the 3' single-stranded ends 
of the molecule by using a suitable deoxyribonucleotide triphosphate (dXTP) spe- 
cies and terminal deoxynucleotidyl transferase (42, 51). The cloning vector DNA 
is similarly cleaved and treated with exonuclease, then complementary homodeoxy- 
polymeric sequences are added by use of terminal transferase. The two DNA 
fractions are permitted to anneal and then may be treated with exonuclease III, 
DNA polymerase I, and polynucleotide-joining enzyme (DNA ligase) to form cova- 
lently closed circular recombinant DNA molecules (42, 51). In another procedure, 
the DNA is fragmented by using a restriction endonuclease which generates short 
complementary cohesive ends at both termini of the DNA duplex (35, 54, 69). The 
cloning vector DNA is treated with the same enzyme, the two preparations are 
mixed to permit annealing, and ligase is added to yield covalently closed circular 
recombinant DNA molecules (18, 79). Recombinant DNA molecules formed by 
either method are then used to transform a suitable bacterium (17, 52) to yield either 
transformants inheriting the chimeric plasmid or phage-contining cloned DNA 
segments. A variety of plasmid cloning vectors (18, 39, 79, 82) and bacteriophage 
\ variants (29, 59, 66, 81) have been developed that possess features that facilitate 
their use for cloning foreign DNA. Most research to date has utilized E coli K 12 
strains for the cloning of DNA from a staphylococcus plasmid (12), Xenopus laevis 
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(58), sea urchins (45), Drosophila melanogaster (32, 80, 81, 83), mouse'mitochondria 
(13), yeast (J. Carbon, personal communication), and undoubtedly a greater diver- 
sity of other organisms by the time this article goes to press. The techniques can 
be readily adapted to use phage- or plasmid-cloning vectors that are able to replicate 
in procaryotic hosts other than E. coli. Sufficient information is also at hand to 
enable the use of various plant and animal DNA-containing viruses as vectors for 
cloning DNA in eucaryotic host cells infectable by the virus in question. Although 
so far unrealized, it is possible that plasmid DNA obtained from eucaryotic cell 
organelles will be usable as cloning vectors for eucaryotic cell research. 



POTENTIAL BENEFITS 
Introduction 

The construction of chimeric microorganisms that possess and express specific 
genetic information from species that do not ordinarily exchange genetic informa- 
tion with the bacterial host presents certain problems which need to be solved. 
Although there should be no difficulty in observing expression of genes from procary- 
otic organisms of bacterial plasmid or phage DNA, it is unlikely that eucaryotic 
genetic information would be expressed often in procaryotic hosts. This problem 
makes it difficult to use random DNA segments from a eucaryotic organism to form 
recombinant DNA molecules (i.e. a "shotgun" experiment) and then to select a 
bacterial transformant possessing the desired eucaryotic genetic information. At 
present, such desired transformant clones can be obtained by starting with a DNA 
that has been either enriched for a specific DNA sequence or copied from a purified 
RNA (i.e. cDNA) and then recognized by using specific molecular probes, such as 
a purified RNA species (1 1). Alternatively, the starting material can be unfrac- 
tionated DNA, and the desired clone can be recovered by using a statistical enrich- 
ment procedure in conjunction with the purified RNA probe (16). Thus, in order 
to facilitate cloning of any eucaryotic gene, it is necessary first to understand the 
basic differences between procaryotic and eucaryotic transcription and translation 
mechanisms and then to either couple eucaryotic genetic information with the 
appropriate start and stop signals of procaryotic origin or endow the procaryotic 
host with the ability to permit faithful transcription and translation of eucaryotic 
information. Endeavors are already in progress to couple specific procaryotic opera- 
tor-promoter sequences onto eucaryotic DNA sequences, and it can be anticipated 
that the intensive research in this area will eventually solve these problems. It is also 
evident that the use of chimeric microorganisms to produce an abundance of certain 
desired gene products will mean that the host strains must be manipulated to liberate 
the product into the growth medium without cell lysis. 

Acquisition of Basic Knowledge 

The use and refinement of the techniques for constructing chimeric microorganisms 
would provide a wealth of knowledge that cannot easily be obtained by other means 
(48, 74). For example, the ability to clone specific eucaryotic DNA sequences would 
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permit characterization of eucaryotic chromosome structure, including the basis for 
maintenance of reiterated sequences, the distinctions between the DNA in eu- 
chromatic and heterochromatic chromosome regions (if any), and the eventual 
construction of complete chromosome maps. These techniques can also be used to 
deduce the mechanism for generating the potential for antibody diversity as well as 
other aspects of genetic regulatory phenomena, even in the absence of the ability to 
express such eucaryotic DNA. When the problem of expression of eucaryotic DNA 
in procaryotic organisms is solved, it would be possible to study gene regulation and 
do complementation analyses and fine-structure mapping of genes. These techniques 
would be useful in characterizing viral gene functions relevant to understanding 
infection, latency, and oncogenesis. Procaryotic genetic information from organisms 
that have unusual growth properties and that lack well-defined systems of genetic 
analysis can also be studied in a genetic background, which would permit all the 
tools of modern K coli genetics to be used to elucidate the basis for these important 
metabolic activities. These new types of information on the structure, expression, 
and regulation of genetic information from a diversity of organisms will facilitate 
our understanding of both normal and abnormal functioning of life processes. The 
application of such new knowledge will have a profound effect on life and living and 
will undoubtedly lead to applications of the technology to the solution of problems 
that are completely unforeseen at present. 

Applications of Technology 

Although one cannot predict all applications of the ability to genetically engineer 
microorganisms, some applications are readily apparent, and others, although more 
speculative, can be predicted. Certainly, chimeric microorganisms could be used to 
produce products useful in medicine, agriculture, commerce, and industry. In the 
area of public health, I envision the construction of strains for the development of 
safe, effective vaccines to cope with viral, bacterial, mycotic* and parasitic infections, 
I as well as the potential use of microbes to directly contend with obnoxious patho- 
gens, insect pests, etc. The continued production of a sufficient quantity of high- 
quality foodstuffs to feed the population of the world is one of society's major goals, 
but despite considerable progress in this area, we do not appear to be any closer to 
attaining this goal. Therefore chimeric microorganisms will undoubtedly be used to 
produce large quantities of high-quality animal proteins inexpensively, for use as 
diet supplements, or, alternatively, to genetically manipulate the microorganism so 
as to eliminate problems associated with endotoxins and high concentrations of 
nucleic acids, thereby rendering them suitable sources of nutritive, nontoxic single- 
cell protein. The cloning of specific DNA sequences in microorganisms could also 
be used to genetically manipulate plant and animal species of agricultural impor- 
tance to increase nutritive value and/or reduce reliance on expensive fertilizers or 
foodstuffs. Furthermore, these technologies would most likely be used to contend 
with specific types of environmental pollution and in some instances to degrade 
accumulated wastes through the production of usable, inexpensive energy sources. 

In my view, the discoveries leading to the ability to couple genes from organisms 
that do not normally exchange genetic information and to introduce them into 
microorganisms where the foreign genes can be cloned and ultimately expressed 
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represents one of the most profound series of scientific discoveries in the modern 
era of science. Although I am extremely optimistic about the benefits that will 
accrue from this research, it should also be evident that the utilization of these skills 
to synthesize organisms that probably never existed previously is not without haz- 
ard. Certainly, the growth of large volumes of chimeric microoganisms, with the 
potential for their inadvertant release and even more their design for intentional 
release into the biosphere for a specific purpose, may have unpredictable conse- 
quences for the quality of life of a diversity of species inhabiting the biosphere. 



POTENTIAL BIOHAZARDS 
Introduction 

The potential biohazards associated with recombinant DNA molecule research led 
to a series of discussions in 1973 (73) and 1974 that culminated in the publication 
of a letter by a NAS-NRC committee chaired by Paul Berg (8). This letter briefly 
assessed some of the potential risks and requested that scientists defer certain types 
of experiments and use considerable caution and good judgment in the performance 
of other types of experiments. This letter initiated extensive discussion among 
members of the scientific community that has gone on unabated since that time and 
has led to the convening of an international meeting at the Asilomar Conference 
Center in Pacific Grove, California, in February 1975 for the express purpose of 
considering all aspects of recombinant DNA molecule research and formulating 
provisional guidelines for the performance of this research (9). These topics have 
also been discussed and debated at a series of other scientific meetings held through- 
out the world since that time. These discussions also led to the formation of a 
National Institutes of Health Recombinant DNA Molecule Program Advisory 
Committee to establish more formal guidelines for US investigators. Similar com- 
mittees are at work in other nations to devise comparable guidelines (2) so that this 
research can be conducted worldwide in a manner that would ensure maximum 
benefits while minimizing, if not precluding, manifestation of biohazards. 

During these discussions and debates, some members of the scientific community 
have contended that the construction of recombinant DNA molecules and their 
introduction into microorganisms that might be inadvertantly released from the 
laboratory environment would be without great potential biohazard. One such 
argument, which has considerable merit from a genetic standpoint, is that a bacterial 
cell endowed with additional traits that are without benefit to the cell would not 
compete well with cells that are not burdened with the maintenance and expression 
of this additional genetic information. The limitation of this argument is that there 
is no accurate way to predict whether a new trait will or will not confer a selective 
advantage to the cell in any of the various ecological niches in which the cell might 
find itself. 

Another oft-heard argument is that such composites of genetic information have 
undoubtedly been tried during the course of evolution, with the consequence that 
such combinations have not been successful and therefore have not survived. Al- 
though I would agree that such combinations of traits might have occurred at some 
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time during the course of evolution, I reject the concept that these combinations of 
traits, when introduced into the biosphere as we know it now, would not pose a 
problem. Certainly, the earth has also evolved and conditions that now prevail are 
quite different from those present at the time when such combinations of traits might 



Because there is a conspicuous lack of information about whether some of the 
surmised potential biohazards are real or not, some scientists have also taken the 
position that one should not worry about these potential biohazards until data are 
available to substantiate such speculations. Although I agree that it is uncharacteris- 
tic for scientists to fear the unknown or to postpone utilization of an available 
procedure to help deduce the secrets of life, it must be noted that chimeric microor- 
ganisms and the recombinant DNA which they contain have the potential for 
perpetuation and thus for irreversible fixation in living organisms. Since scientists 
are citizens of the world, it would therefore be totally irresponsible for any member 
of the scientific community to exercise any less than the required or advisable set 
of precautions for the performance of any given experiment in regard to the potential 
biohazards associated with it. I therefore consider it much more prudent to assume 
the worst and to investigate the existence and severity of postulated biohazards 
before relaxing the various safeguards for performing experiments on construction 
and use of microbes with recombinant DNA molecules. 

In evaluating the potential biohazards associated with the construction of a 
chimeric microorganism, one must consider the probability of its escape from the 
research laboratory environment, the probability of survival and perpetuation of the 
recombinant DNA in nature, and lastly the probability that, if recombinant DNA 
molecules survive and are perpetuated in nature, adverse consequences could be 
manifested. The factors contributing to each of these probabilities and the possible 
variables need to be considered before attempting to assess what these probabilities 
may be. 

Factors Affecting the Probability of Escape of Chimeric Microorganisms 

personnel The first line of defense against the inadvertent release of a chimeric 
microorganism or of a viral vector carrying foreign DNA has to be the personnel 
who work in or around a laboratory in which such research is being conducted. All 
those actively engaged in the research must be both well trained in the use of 
required procedures, instruments and facilities, and meticulous practicioners of 
these skills. Furthermore, other laboratory workers such as dishwashers, media 
preparers, etc, and even occasional or intermittent laboratory visitors such as secre- 
taries, custodians, equipment repair technicians, plumbers, electricians, etc, must all 
understand the nature of the experiments being conducted, and especially the nature 
of the associated biohazards, real or potential, so that any entry by these individuals 
into the research laboratory and any of their actions will not lead to the escape by 
chimeric microorganisms or viral vectors. The procedures and precautions all these 
individuals employ should be commensurate with the risks and should follow what 
is ordinarily associated with research on pathogenic microorganisms (14, 19, 26, 38, 
47, 49, 60, 61, 70, 78)! 



have been unsuccessfully tried. 
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0 f Unfortunately, this is easier said than done. For example, although I am familiar 

a with most of the procedures for working with virulent pathogens and although our 

re laboratory sterilizes or decontaminates everything prior to disposal or dishwashing, 

nt many years of working on the genetics of E. coli K12 have led to the adoption of 

procedures that are totally unsuitable for working with chimeric microorganisms. 
^ e For example, dishwashers do not automatically discard all bottle or tube caps 
ne lacking liners or screw cap tubes with chipped lips; screw cap centrifuge bottles are 
re never inverted with fluid therein to check for leaks; gaskets on centrifuge rotors are 
i s . not routinely replaced so as to maintain a good seal; and, although methods for 
,| e correct sterilization of bacteriological loops and needles are known, they are seldom 
, r _ practiced. Because of these factors and because many individuals who are now 
or interested in undertaking experiments on the formation of recombinant DNA mole- 
jt s cules have never had any training in microbiological skills, there is some doubt about 
, er whether the experimentalist will actually adopt procedures that would effectively 
ie t eliminate any inadvertent escape by chimeric microorganisms. Even when appropri- 
i a l ate procedures are employed, laboratory workers are often known to acquire anti- 
ng body titers against the virus or bacterial species with which they are working — a 
d s good indication that laboratory workers will be exposed to chimeric microorgan- 
3n isms, potentiating the possibility of the workers becoming the vectors for carrying 

microorganisms out of the laboratory environment. 

a Another personnel-related problem concerns the probability of contaminating 

he cultures used to construct or containing chimeric microorganisms. In either case, 

he recombinant DNA molecules, if inadvertently permitted to escape from the labora- 

TA tor y environment, may be introduced into microbial hosts that possess a higher 

be probability for survival and potential for subsequent transmission of recombinant 

)l e DNA than the intended host. In this regard, it is also evident that the likelihood 

[es of cultures used for recombinant DNA molecule research becoming contaminated 

with robust permissive strains would be increased if the same laboratory group is 

actively engaged in research with the microbial species and/or vectors being used 

for cloning. 

ric FACILITIES Considerable knowledge is available about the use of physical con- 

iel tainment equipment and facilities to reduce the probability of escape of pathogenic 

UI microorganisms and viruses from the laboratory environment (27, 38, 47, 6 1). These 

of include using microbiological safety cabinets, laminar flow biohazard hoods or 

of negative-pressure laboratories, and special traps on drain lines, vacuum lines, or 

lia equipment in containment cabinets for any procedure likely to produce aerosols. It 

re- is also possible to install autoclaves in the walls of such facilities to make it possible 

all to sterilize material prior to its removal from the room in which a given experiment 

ire is done. In addition to the absolute reduction in dissemination of microorganisms, 

als the use of such facilities and equipment has a psychological effect in causing labora- 

by tory workers to employ procedures commensurate with the risks associated with the 

;se task at hand and to thwart entry of other personnel into the room in which such 

iat experiments are done unless direct permission is given by someone in authority. The 

J 8, probability of escape of microorganisms or viral vectors containing foreign DNA 
would therefore decrease as the physical containment provided by the equipment 
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and facilities used increased. It should also be noted that the safety "barriers asso- 
ciated with any type of physical containment system would be reduced by the 
absence of appropriate measures to control insects and/or rodents in or around the 
laboratory. 

emergency OR accident plan Laboratory accidents are a reality of life and 
the adoption of an appropriate plan of action in the case of an accidental spill of 
recombinant DNA or a viral vector or microbe that contains it can markedly reduce 
the probability of its escape from the laboratory environment. Thus procedures to 
cope with such accidents should be developed and available for use prior to com- 
mencing recombinant DNA molecule experiments. 

Factors Affecting the Probability of Survival and Perpetuation 
of Recombinant DNA 

NATURE OF MICROBIAL HOST STRAINS AND CLONING VECTORS In terms of 

a microbial host strain to be used to introduce recombinant DNA, one can consider, 
in order of decreasing potential for survival, the use of recently isolated wild-type 
strains, laboratory-attenuated wild-type strains or their derivatives, and strains 
whose ability to survive and/or multiply is solely dependent upon uniquely defined 
laboratory conditions that are unlikely to be encountered in nature, especially in the 
ecological niches that are normal habitats for the microbial species in question. The 
survival of the host employed and the possible perpetuation of recombinant DNA 
by transmission to other microbes also depends on the routes of escape from the 
laboratory. For spills and aerosolization, resistance of the host to dessication, disin- 
fectants, UV irradiation, etc, is important, and the potential of the host to form 
cysts, spores, conidia, etc, is relevant. For inadvertent release by disposal down the 
drain, one must consider the host's survival potential in an aqueous (polluted) 
environment. Since contamination of personnel is the most likely means of escape 
for the microbial host in question, the likelihood of its survival on clothing, on skin, 
in the pharynx, in the oral cavity, during passage through the stomach and the 
remainder of the gastrointestinal tract, and ultimately in the sewers and during the 
local sewage treatment processes must be considered. In all these instances, we must 
examine the probability of the host surviving long enough by any of these potential 
means of escape to find a suitable environment, either for its multiplication or where 
other robust microbes present might act as recipients for receipt of the recombinant 
DNA. 

In terms of plasmid-cloning vectors, the use of nonconjugative 1 plasmids affords 
more containment than use of conjugative 1 plasmids, and nonconjugative plasmids 
that are poorly mobilizable 1 by conjugative plasmids that might be transferred from 
microbes in nature to the host containing recombinant DNA are better than noncon- 
jugative plasmids that are readily mobilizable. If the plasmid-cloning vector is 

'A nonconjugative plasmid is unable to promote its own transfer by conjugation, whereas a 
conjugative plasmid can promote either conjugational transfer of itself or, through a process 
termed mobilization, the transfer of chromosomal and/or nonconjugative plasmid DNA (63). 
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genetically manipulated so that its replication depends on the host strain or so that 
its transmission to other microbes would be lethal to them, then one need be 
concerned only with the survival of the host carrying the recombinant DNA. One 
must also consider whether the plasmid-cloning vector possesses traits that confer 
a selective advantage on the host strain that would cause it to be more likely to 
survive. 

Similar considerations can be given for the survival of viral cloning vectors, with 
wild-type vectors more likely to survive and perpetuate cloned DNA segments than 
those viral vectors that depend solely on a laboratory-adapted strain for multiplica- 
tion. If the viral vector can establish a carrier state in the host microorganism, one 
must consider the survival potential of the host by the various routes of escape 
discussed above. Likewise, if the viral vector does not solely depend on the host 
strain for its replication, then one must consider the virus's potential to survive by 
any route of escape until it encounters a sensitive permissive host cell. 

ECOLOGICAL NICHES OCCUPIABLE BY ORIGINAL AND CHIMERIC HOSTS 

The probability of survival of escaped chimeric microorganisms is also affected by 
the natural ecological niches of the host microorganism, the contributions of the 
foreign DNA to alterations in the potentially occupiable ecological niches, and also 
the distance between the occupiable ecological niches and the laboratory setting in 
which the experiments are performed. For viral cloning vectors, a principal factor 
governing survival and perpetuation of escaped cloned DNA is the proximity of the 
laboratory environment to the ecological niches in which potential sensitive permis- 
sive hosts for the viral vector might exist. In these regards, one must also consider 
the alterations of the biosphere by current civilization that permit given microbial 
types to exist in new ecological niches. For example, K coli and other enteric 
microorganisms that have probably always been prevalent in the intestinal tracts of 
warm-blooded animals are now found to occur in cold-blooded animals, and higher 
invertebrates, and certainly in sewers, streams, rivers, lakes, and estuarine waters, 
as well as on land in both urban and agricultural environments (20). Thus the use 
of E. coli and other enterics for the formation of chimeric microorganisms places 
normal ecological niches for the organism in or near the laboratory setting. It should 
therefore be evident that the use of a microbial host that finds humans to be a 
suitable ecological niche provides the greatest likelihood of successful escape, 
whereas microorganisms or chimeric microorganisms that find soil and/or plants 
as normal ecological niches have a somewhat reduced probability of being carried 
from the laboratory environment to their normal ecological habitat. The use of 
marine microorganisms as hosts for recombinant DNA would only seem to pose a 
hazard if the laboratory were located adjacent to a marine environment, and simi- 
larly the use of microbes that might occupy a very restricted ecological niche (such 
as hot springs) would be least likely to ever get from the laboratory environment 
to their normal ecological habitat. 

TRANSMISSIBILITY It is now realized that gene flow among microorganisms in 
nature is far more common than previously thought; thus one must bear in mind 
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the likelihood of a cloned DNA fragment being transferred from a viral-cloning 
vector or chimeric microorganism to some other vector or microorganism in nature 
or in the same laboratory that has a higher probability of survival. Most, if not all, 
species within all procaryotic genera are lysogenic for one or more temperate phages 
(1, 7, 34). Since most temperate phages can probably act either as generalized or 
specialized transducing phages, it is likely that transduction in nature can lead to 
transmission of genetic information from one procaryotic microorganism to another 
in the same or a closely related species. The frequency with which this occurs is 
likely to be low, although it has only been measured in one instance (62). The 
probability for transductional transmission of recombinant DNA from the host 
microbe to other microorganisms in nature can be reduced, if not abolished, by using 
a nonlysogenic host that is also resistant to transducing phages that are known to 
infect the host strain. This may be easier said than done, however, because cell 
surface changes that lead to resistance to known transducing phages are likely to 
result in sensitivity to other phages that have not been studied and that may be 
capable of transduction. 

Plasmids are now known to occur in at least 40 bacterial genera (31, 46, 55, 56, 
68), and it is probable that such plasmids will be found in other bacterial genera as 
they become better studied. Although conjugation, which is promoted by conjuga- 
tive plasmids, is best known among gram-negative bacterial genera, the occurrence 
of some type of conjugational gene transfer in mycobacteria, Nocardia, Actinomyces, 
and Streptomyces and the newly discovered presence of conjugative plasmids in 
streptococci (43) causes me to believe that conjugative plasmids are likely to be 
present in a diversity of other gram-positive genera as well. Wild-type conjugative 
plasmids, as found in nature, are usually repressed with respect to expression of the 
donor phenotype, such that only a small percentage of donor cells are capable of 
plasmid transfer at any given time (31). They are therefore transferred to recipient 
cells of the same strain at frequencies of 10 -3 or lower per donor cell. These frequen- 
cies may be further reduced by entry exclusion or incompatibility (63) if the recipi- 
ent also possesses a homologous resident plasmid, by restriction (5), by differences 
in the cell surface which inhibit the formation of mating unions (22, 30, 57, 67, 75), 
and by differences in the ability of given strains to establish or maintain a plasmid 
once transferred into their cytoplasm. These barriers may reduce conjugative plas- 
mid transfer by another several orders of magnitude or abolish it altogether. 

Although conjugational gene transfer in K coli occurs at equal frequencies under 
aerobie and anaerobic environments (77), there are other environmental factors 
which may act to further reduce conjugational plasmid transfer. I have found that 
the yield of plasmid transconjugants decreases by the square of the dilution of the 
parental densities below 10 8 cells/ml, thus transconjugants inheriting plasmids are 
reduced by three to four orders of magnitude in reducing mating titers from 10 s to 
10 6 cells per ml, the latter being the usual titers for E. coli in the intestinal tracts 
of humans and animals (20). Temperature also exerts an influence on conjugational 
plasmid transfer, with the optimum for F, I, and N group plasmids commonly found 
in enterics being near 37°C, with essentially no transfer below 28°C since donor cells 
are unable to express the donor phenotype at these lower temperatures (10, 55). 
Thus transfer of those plasmid types in nature probably occurs in intestinal tracts 
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, ning and in fermenting excreta (i.e. manure piles, sewage treatment plants, etc) and not 
iture in streams, rivers, or lakes. Conjugative plasmid types commonly found in soil 
lt a jj bacteria or in fish pathogens, for example, are likely to be transferred at higher 
iage g efficiencies at temperatures in the 20° to 30°C degree range, however, so it is 
>c j Qr probably premature to generalize from the results with the well-studied F, I, and 
t0 N plasmids found in K coli and its relatives. Conjugational plasmid transfer seems 
Jther also to be restricted to pH values of 6 to 8.5 (55), requires actively growing cells 
irs j s (23-25), and is diminished in environments of lower nutritive value for the metabol- 
-j^ e izing bacteria. The probability for transfer of nonconjugative plasmids is indeed very 
jj 0St low because of the requirement that a cell harboring such a plasmid must first 
lsing acquire a conjugative plasmid that is able to mobilize the nonconjugative plasmid 
ra t0 for transfer to a suitable recipient. Frequencies of nonconjugative plasmid transfer 
. ce jj by such triparental matings may occur under ideal laboratory conditions at frequen- 
cy t0 cies ranging from 1(H per primary donor cell to an immeasurable quantity, depend- 
iy be m S on tne particular plasmids employed. Thus conjugational plasmid transfer in 
nature, although obviously occurring, is difficult to quantify at measurable rates or 
j ^ frequencies. 

,j! a ^ Transformation as a means of gene transfer is known to occur in streptococci, 

j uga _ bacilli, Haemophilus, and Neisseria, and in some instances this has been observed 

, ence to occur in animal hosts (33). Successful transformation of K coli with viral, 

iyceSt plasmid, and chromosomal DNA requires prolonged incubation at 0°C in the 

(j s m ' presence of CaCl 2 , and then a heat shock to facilitate DNA uptake (17, 21). Such 

t0 transformation occurs at very low frequencies, even under these unusual conditions 

, at j ve (which are probably not encountered in nature). It is thus unlikely that transforma- 

jf t h e tion would be a very significant contributing factor for gene flow in nature for most 

jj e Q f gram-negative microorganisms. 

pj ent When inheritance of genetic information transferred by virus infection, transfor- 
mer mation, transduction, or conjugation requires genetic recombination, the frequency 
•cipi. generally depends on similarities in guanine-cytosine contents and nucleotide se- 
ances quence homologies between donor and recipient DNA. In this regard, however, one 
t 75^ must also be aware of the prevalence of insertion sequences (40, 41, 65) and the 
lSm i(j newly discovered translocatable sequences (36) discovered in enteric microorgan- 
plas_ isms, which seem to permit, at low frequency, the movement of a DNA segment from 
one genetic element to another that is independent of any necessary nucleotide 
jnder sequence homology. Whether such sequences are prevalent in other organisms in 
lCt0 rs nature or are present on cloned DNA fragments remains to be seen. If so, such 
[ that sequences could facilitate the formation of unusual recombinants by using one or 
)f the more of tile natUf aHy occurring mechanisms for gene flow among microorganisms 
is are in 

10 8 to If l° wer eucaryotic microorganisms are employed as hosts for cloning recombi- 

t ra cts nant DNA molecules, it must be realized that many of these exhibit a classical sexual 

ional *yP e of reproduction as well as various types of parasexual gene transfer mecha- 

"ound nisms. In either event, the possibility exists not only for transfer of nuclear genetic 

: cells material but also for cytoplasmic genetic material. These systems, however, are 

, 55). generally restricted to genetic exchange only within a species and then only when 

tracts mating type factors are complementary. 
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SELECTIVE ADVANTAGES OR DISADVANTAGES CONFERRED BY FOREIGN 

DNA In addition to the effects cloned foreign DNA might have on the ecology of 
the chimeric microorganism and/or the transmissibility of cloning vectors, its pres- 
ence may alter the survival of cloning vectors or hosts that possess it. The long-term 
survival and perpetuation of an escaped recombinant DNA molecule necessitates 
that the foreign DNA confer either some selective advantage or at least no selective 
disadvantage to viral vectors or chimeric microorganisms that possess it. Probably 
the replication of the additional foreign DNA in the absence of any benefit to do 
so is a selective disadvantage. Similarly, the synthesis of gene products specified by 
the recombinant DNA, whether functional or non-functional, and which bring no 
benefit to the viral vector or host microorganism imposes an even greater burden. 
It is likely that in most instances the presence of cloned foreign DNA will be 
detrimental to the competitive survival of cells and vectors that possess it. 

The only available information on this point comes from a series of experiments 
performed by R. Davis (personal communication). He has taken random DNA 
segments from yeast, Klebsiella, and Dictyostelium, cloned them on a X vector, and 
then analyzed the frequency distributions of different sizes of foreign DNA segments 
as a function of the number of repeated X infection cycles. Although there was 
initially a large assortment of random fragments, after 30 infective cycles only one 
or two specific fragments gained ascendency in each case. When X vectors contain- 
ing these surviving fragments were studied in mixed infection with the X vector 
lacking a foreign DNA sequence, the X vector lacking the foreign DNA always 
became the predominant species. Thus in no case did a foreign DNA segment confer 
a selective advantage to the X cloning vector. 

Factors Affecting Probable Consequences of Escape and Survival 
of Recombinant DNA Molecules 

The overriding consideration concerns the degree with which the trait specified by 
the recombinant DNA is compatible with traits possessed by the host microorgan- 
ism or by other microorganisms that can act as recipients of genetic information 
from the host microbe, since these relate to the manifestation of a phenotype leading 
or not leading to the displacement of other organisms from their normal ecological 
niches. This overall consideration can be approached by first assessing whether or 
not the cloned DNA information is expressed in the primary host or in hosts that 
are capable of receiving genetic information from the primary host. As stated above, 
expression of eucaryotic genetic information in procaryotic organisms is not likely 
to be universally observed without additional genetic manipulation to permit faith- 
ful transcription and translation of eucaryotic genes. The reciprocal situation is 
equally true, but probably it is not a problem with regard to expression of eucaryotic 
genetic information in eucaryotic microbial hosts or of procaryotic genetic informa- 
tion in procaryotic microbial hosts. I anticipate, however, that difficulties in expres- 
sion of foreign DNA will ultimately be eliminated and thus in the following 
discussion I assume that foreign gene expression is likely to occur. 

With respect to the compatibility of an expressed foreign-DNA-specified trait in 
a microbial host or in other microorganisms with which the primary host can 
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exchange genetic information, there is a range of possibilities. For example, endow- 
ing K coli with the ability to synthesize keratin, if that were possible, would be 
unlikely to lead to hairy bacteria, whereas the endowment of E coli with the 
capability to carry out photosynthesis, because of transmission of such genetic 
information to other gram-negative soil-inhabiting microorganisms, might lead to 
the potential for a green earth. Also related to the expression of foreign genetic 
information is the ability of the primary host microorganism or of other microorgan- 
isms with which it can exchange genetic information to produce a gene product that 
can be released into the biosphere by occasional lysis of cells or by active secretion, 
in such a way that it has an adverse effect on the physiology, metabolism, or 
livelihood of other organisms that might come in contact with the substance. In 
these regards, then, we can look at competitive displacement of other organisms 
from their normal ecological niches as resulting from competition for essential 
foodstuffs, the production of products that are deleterious to other organisms, 
and/or the exhibition of traits of general pathogenicity, virulence, and invasiveness. 

In considering the potential for surviving recombinant DNA, whether in the 
original or in a secondary host, to cause displacement of other organisms from their 
normal ecological niches, one must also realize that differences in susceptibility of 
other organisms may depend upon their age. For example, newborn mammals have 
an immature intestinal mucosa that permits the transport of macromolecules; thus 
newborns could be susceptible to gene products produced by chimeric microorgan- 
isms that would not affect adults of the species. In a similar vein, one must also 
consider food chains in which an organism that might be displaced by chimeric 
microorganisms might initially appear to have no relevance to us, but which might 
play an important role as predator of another organism that, if unchecked, might 
become a serious pest or even a vector of an infectious disease agent active against 
other organisms in the biosphere. Similarly, the displaced organism might itself be 
prey and/or an essential foodstuff for another organism that might be of more direct 
importance to humankind. These types of subtle effects are difficult to predict and 
give cause for concern about the inadvertent escape and survival of recombinant 
DNA molecules. 

Estimating the Probability of Manifestation of Adverse Consequences 
by Recombinant DNA Molecules 

escape In its proposed guidelines for research involving recombinant DNA 
molecules, the National Institutes of Health has stipulated four categories of physi- 
cal containment, with their associated procedural precautions. These categories, 
designated PI, P2, P3, and P4, are to be employed commensurate with the potential 
risks associated with any given experiment, depending on whether these risks are 
minimal or nonexistent, low, moderate, or high. If all materials are sterilized or 
disinfected prior to disposal (a requirement for all four categories of containment), 
then this should all but eliminate the possibility that chimeric microorganisms or 
viral vectors containing foreign DNA could escape down the sink. If there is no 
mouth pipetting (advisable for PI containment and a requirement for the P2, P3, 
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and P4 containment categories), then accidental ingestion of viral vectors or hosts 
containing foreign DNA should also be a remote possibility. However, many com- 
mon laboratory procedures, as well as accidental spills, produce aerosols, and this 
production of aerosols is most likely to lead to contamination of investigators and 
the laboratory environment (28). Table 1 provides data on the doses of viruses or 
microorganisms that are likely to be inhaled because of the production of aerosols 
during several common laboratory operations performed by both proper and im- 
proper techniques. Such operations as inoculating cultures, pipetting, mixing (vor- 
texing), centrifugation, and so on also produce aerosols (28). It should be noted that 
the contamination of investigators and the laboratory environment due to aerosols 
make it all the more important that investigators wear appropriate protective cloth- 
ing which should not be worn outside the laboratory, refrain from eating, drinking, 
and smoking in the laboratory, since these practices lead to the ingestion and/or 
inhalation of aerosolized materials, wash their hands with disinfectant soap upon 
completion of procedures (disposable gloves which would be autoclaved are re- 
quired for some experiments using P3 and P4 containment), and disinfect the work 
area upon completion of procedures. 

It is thus apparent that, even when acceptable practices are used in the type of 
facilities required for a given experiment, some virus vectors or host organisms will 
escape from the laboratory environment. I estimate that the numbers of escaped 
viral vectors or host microorganisms should be reduced about 1000-fold for each 
increasing level of containment starting with PI. Estimating the numbers of virus 
vectors or hosts that might escape per day per investigator for a properly operated 
PI facility is difficult, but let me guess at a value of 10 5 . If we then assume 1000 
investigators working on 300 days per year, then 3 X 10 10 escape from PI facilities, 
3 X 10 7 from P2 facilities, and 3 X 10 4 from P3 facilities. These routes of escape are 
by contamination of the investigator (on clothing or by breathing or ingestion), by 
discharge through the ventilation system, by removal of floor sweepings, and by 
accidental disposal down the drain. 

Some members of the scientific community have argued that PI and P2 contain- 
ment afford no barriers to escape of chimeric microorganisms and therefore have 
suggested that P3 containment be the minimum requirement for recombinant DNA 
molecule research. They base their arguments, in part, on the observation that many 
microbiologists do not use such facilities and procedures properly, and they fear that 
many nonmicrobiologists who use such facilities and procedures will not initially 
know how to use them properly. Although I concede that this may be so and 
acknowledge that the estimates made above for the escape of virus vectors and 
microbial hosts would be increased considerably if the appropriate facilities and 
procedures are improperly used, I believe that one cannot legitimately redefine 
standard microbiological practices that are based on years of experience. For exam- 
ple, most clinical microbiology laboratories use PI or P2 facilities and procedures 
and P2 facilities are sufficient for research with Bacillus anthracis, Clostridium 
botulinum, Corynebacterium diphtheriae, Salmonella typhi. Shigella dysenteriae, 
Streptococcus pneumoniae, Vibrio cholera, Adenoviruses, Herpes virus hominus, 
Simian virus 40, measles virus, and so on (14). It thus would be inappropriate to 
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penalize all those who do know how to use PI and P2 facilities correctly and 
procedures and who wish to conduct low-risk experiments by requiring that expen- 
sive P3 conditions be the minimum. On the contrary, it should be the ethical 
responsibility of all investigators conducting recombinant DNA molecule research 
to ensure that their collaborators and support personnel follow the acceptable 
procedures for the type of experiment being performed. Peer pressure and review, 
assumption of liability for adverse consequences due to recombinant DNA molecule 
research by the principal investigator and the institution, and the possibility that the 
investigator and/or the institution might be deprived of government research sup- 
port should ensure this. 

SURVIVAL In assessing the probability of survival of recombinant DNA mole- 
cules in nature, given that accidental escape has occurred, we suffer from a paucity 
of factual information. What follows therefore represents my best judgments based 
on the available data and my own experiences. We must first consider the survival 
of the host or viral vector which carries foreign DNA and then the potential for 
survival of the recombinant information if it is transmitted to another organism in 
nature. Because most current studies are being conducted with R coli K12, I 
consider it as an example to estimate the probabilities for survival of recombinant 
DNA. In the guidelines for recombinant DNA molecule research being promul- 
gated by the National Institutes of Health (NIH), several levels of biological con- 
tainment for E. coli K12 host- vector systems are stipulated. The EK1 systems 
utilize existing E. coli K12 strains either with nonconjugative plasmid or X phage 
vectors and are supposed to afford a moderate level of containment. The EK2 
systems must be genetically modified to provide a high level of containment so that 
the probability of survival of cloned foreign DNA is 10~ 8 or less, either by survival 
of the host or by transmission and survival of the cloned DNA in some other host 
or vector. 

E. coli K12 was isolated from a human patient in 1922 but since that time has 
become adapted to the laboratory environment (6). Although it will survive passage 
through the intestinal tracts of humans, calves, rats, and mice, it has failed to 
colonize these species in a limited number of studies (3,4,76; S. Falkow, personal 
communication; R. Curtiss and co-workers, unpublished). These experiments, how- 
ever, are of limited value in reaching any valid generalizations because the factors 
leading to colonization of the intestinal tract are poorly understood. For example, 
R. coli strains isolated from one strain of mice are generally unable to colonize other 
strains of mice or other rodent species but pass through the intestinal tract in much 
the same way as does R coli K12. Thus there seems to be some type of mammalian 
host species or strain specificity that plays a role in colonization. The method of 
prior growth of the E. coli strain also influences the degree of colonization or 
persistence of the strain in the intestinal flora with anaerobic growth leading to 
longer persistence than aerobic growth. In the studies with feeding of R coli K12 
to humans (3,4, 76), variables such as age and natural diet of the individuals, which 
should also influence colonization, were not systematically studied. R coli K12 cells 
surviving passage through the intestinal tract remain viable in feces for several days 
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at least, but little or no information is available on the survival of this strain in 
sewage or during passage through sewage treatment plants. The plentifulness of 
E. coli strains in sewers, rivers, etc, leads one to believe that this species not only 
survives for some period of time in these environments, but must also undergo active 
multiplication. Certainly, unpolluted waters receiving vigorous aeration are not 
favorable for survival and multiplication of E. coli or other enteric microorganisms. 
Thus the quality of receiving waters, which is generally low throughout the world, 
is a major factor in the ultimate survival of inadvertently released, chimeric E. coli 
strains. E coli strains are found in the pharynx and oral cavity of healthy individuals 
on an intermittent basis (Curtiss et al, unpublished), but little is known relative to 
the likelihood of E. coli K12's survival in these ecological niches, or for that matter, 
its survival on the skin. There is also little information about the survival of E. coli 
K12 during conditions of drying following a laboratory spill, although we have 
found a 50-90% death rate per day with E. coli cells dried under sterile conditions 
in a growth medium. 

Based on these considerations, I estimate that escaped EKl-type E coli K12 cells 
would have a mean probability of survival of about 10~ ! per day, although one must 
consider that multiplication might occur in some favorable environments such as 
a sewage treatment plant. By using an EK2 E. coli K 1 2 host whose survival is solely 
dependent upon a uniquely denned laboratory environment that is unlikely to be 
encountered in nature, we have found that the probability of survival is 10~ 8 or less 
after one day (or less) under conditions that are favorable for growth of normal 
E. coli cells and is about 10 -1 per day under conditions that do not permit growth 
of normal R coli (i.e. in water, physiological saline, after drying, etc). 

In terms of transmission of genetic information from E. coli K12 strains in nature, 
we can consider in decreasing order of likelihood conjugation, transduction, and 
transformation. E. coli K12 is known to be able to receive or donate plasmid DNA 
with representative strains from over 20 bacterial genera under optimal laboratory 
controlled conditions (31, 46, 55, 56, 68). Nevertheless, the frequencies of transmis- 
sion of plasmids in certain plasmid incompatibility groups borders on the level of 
undetectability, even under these optimal conditions. Transmission of some plas- 
mids, however, is known to occur in the intestinal tracts of various mammalian and 
avian species that usually possess an artificial flora because they began as gnotobiotic 
animals or were treated with antibiotics prior to contamination (4, 31). In normal 
individuals, the frequency of conjugative plasmid transmission is all but undetecta- 
ble (3, 4, 31). I would therefore estimate that if an F or I group conjugative plasmid 
(repressed for expression of the donor phenotype) were used to clone DNA in 
wild-type R coli K12 cells, the probability for transferring the chimeric plasmid to 
another microorganism would be no more than 10" 8 per surviving escaped cell. (This 
estimate is based on the maximum observable frequency of 10 -3 per donor cell for 
F or I group repressed conjugative plasmid transfer under ideal laboratory condi- 
tions, which is decreased a further 100-fold because the titers of suitable recipients 
in nature are about lOVml rather than lOVml, and another 1000-fold because of 
other barriers to plasmid transmission such as restriction, entry exclusion, incom- 
patibility, cell surface differences that block union formation, and/or inability of 
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recipient cells to establish and/or maintain the donated plasmid.)*This probability 
is reasonably valid for escaped E. coli K12 chimeric hosts that are either ingested 
or disposed of down the drain where they will ultimately end up in a sewage 
treatment plant, but is unreasonably high for chimeric hosts that escape by aerosoli- 
zation and end up in a dried state that is not conducive to either finding a suitable 
partner or conjugation. This probability would also be further decreased if the 
chimeric conjugative plasmid is in a disabled EK2 host, since a metabolically 
inactive donor very rapidly loses its ability to conjugate (24). If a nonconjugative 
plasmid such as pSClOl is employed as the cloning vector, the chimeric microbe 
must first acquire a conjugative plasmid, which might occur at a probability of no 
more than 10~ 7 per surviving chimeric host cell. (The slightly higher probability for 
acquiring a conjugative plasmid is because host strains used for recombinant DNA 
molecule research are generally restrictionless, thus eliminating this barrier.) Since 
this rare chimeric host cell that now possesses a conjugative plasmid would have 
a probability of no more than 10" 8 per surviving cell of conjugational transmission 
with some other microorganism and since the probability of pSClOl mobilization 
by most conjugative plasmids is about 10" 5 to 10" 7 per successful conjugational 
event, the overall probability for transfer of a chimeric pSClOl plasmid would 
probably not exceed 10" 20 per surviving cell. If the E coli K12 host used has been 
genetically manipulated to reduce or eliminate its ability to accept conjugative 
plasmids, then the probability for conjugational transmission of a chimeric noncon- 
jugative plasmid can be essentially eliminated. This same result can also be achieved 
if the nonconjugative plasmid either depends for its replication on its host strain or 
possesses a killer function that leads to the death of other microorganisms into 
which it is transferred. 

The ubiquity of conjugative and nonconjugative R plasmids and other plasmid 
types specifying surface antigens, colicins, enterotoxins, and so on would suggest, 
however, that although the frequencies of conjugational plasmid transmission are 
generally unmeasurable (i.e. occur at frequencies lower than 1CH), they do occur in 
nature. In this regard, one must remember the immense numbers of bacteria that 
occupy a given sewage treatment plant or a given polluted river, so that probabilities 
of occurrence as low as 10~ 20 may not be all that infrequent. Indeed, the total number 
of E coli cells within the intestinal tracts of the world's entire human population 
is very nearly equal to 10 20 . 

The probabilities for transmission of chimeric plasmid DNA by transduction to 
other microorganisms is difficult to estimate. Specialized transducing phages are not 
likely to be a problem since the probability for integration of a prophage into a 
plasmid vector is likely to be 1(H or lower, and then upon spontaneous induction 
(occurring with a probability of about lCr 4 ) only one in 10 6 phage might possess a 
recombinant DNA fragment. Thus, the probability for getting a specialized trans- 
ducing phage carrying recombinant DNA would already be 10" 16 per surviving 
chimeric E. coli K12 cell even before estimating the probability that this surviving 
cell will encounter a suitable phage in nature and then the probability that the 
transducing phage produced would survive long enough to encounter a robust 
sensitive E coli cell to infect and lysogenize. Generalized transducing phages such 
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as PI (and its relatives), Mu, D108, and Tl could, upon infection of a surviving 
K coli K12 cell, pick up the entire chimeric plasmid with probabilities ranging from 
10 -5 to 10 -9 , depending on the particular phage. Again we must also consider the 
probabilities of the surviving chimeric K coli K12 cell being infected by such phages 
and of the transducing phage encountering a sensitive strain that can be transduced. 
It should also be mentioned that transduction by any of these phages will also be 
decreased 10- to 1000-fold by the probable occurrence of restriction by the infected 
sensitive host. Transductional transmission of recombinant DNA can be made an 
even less likely event by using an EK2 host strain that is resistant to common 
E, coli transducing phages and that is, outside of its carefully controlled laboratory 
environment, metabolically defective in terms of permitting productive phage infec- 
tion. 

The probability for transmission of chimeric plasmid DNA by transformation 
should be dependent on the lysis of the E coli K12 host, the survival of the DNA 
in the absence of degradation and the likelihood that his DNA could be taken up 
by and survive restriction in suitable microorganisms in nature. Since transforma- 
tion of gram-negative microbes requires incubation of cells at 0°C for several hours 
in the presence of CaCl 2 , followed by a heat shock (17) — an unlikely set of condi- 
tions to find in nature — and since even this treatment only results in a 10 -6 probabil- 
ity of transformation in nonrestricting hosts, it seems most improbable that 
recombinant DNA could be transmitted in this manner. The use of an EK2 E. coli 
host that undergoes damage to its DNA outside of its carefully controlled laboratory 
environment would further minimize the probability for transmission by transfor- 
mation in nature. 

In terms of viral cloning vectors, most experiments now being done are utilizing 
bacteriophage X of E. coli K12. Most X cloning vectors now in use are defective 
in lysogenization and plasmid formation [each type of occurrence has a probability 
of 10 -6 or less (50, 53, 71, 72)], and because of this one must consider survival of 
the propagating host strain, which could be either a typical EK1 laboratory strain 
or an EK2 strain that depends on carefully controlled laboratory conditions for its 
growth and survival. In the former case, the probability of survival per day would 
be about 10 -1 , and in the latter case between 1CH and 1CH, depending on the absence 
or presence of conditions permitting rapid death of the EK2 host. We would thus 
have combined probabilities during the first day for formation and survival of X 
cloning vectors that contain foreign DNA as either a plasmid or prophage of 
between 10" 7 and I0~ 14 , although the EK2 host could not survive in nature and 
would have to transmit the foreign DNA to some other robust bacterial strain. The 
use of chloroform, which is customary when making phage lysates (1), would all 
but eliminate these routes of escape and survival, however. The probability of 
survival of foreign DNA carried on escaped X particles depends on the probabilities 
of X survival and its encountering a suitable sensitive host in nature. Wild-type X 
is relatively sensitive to dessication and to the acidity of the stomach (W. Szybalski, 
personal communication) and X-sensitive E coli strains are relatively rare in nature 
(R. Davis, personal communication), probably no more frequent than one per 1000. 
Thus the probability of an escape X surviving and encountering such a host might 
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be between 10"' and 10~ 12 during the first day after escape. Because most such hosts 
would restrict A, this decreases these probabilities another 10- to 1000-fold. If the 
X cloning vector is dependent on its original host for its propagation, however, and 
also is highly virulent and unable to establish lysogeny, then the probability for 
transmission of carried foreign DNA depends on the likelihood of an encountered 
X-sensitive R coli possessing a lambdoid phage as a prophage and of recombination 
yielding a wild-type lambdoid phage possessing the foreign DNA. Even though 
lambdoid phages are reasonably common in wild-type E. coli strains (44, 64), this 
required sequence of events should reduce the overall probability of perpetuation 
of the foreign DNA cloned on X to less than 10" 20 . 

Another factor that reduces the probability of recombinant DNA survival is the 
recent finding that foreign DNA sequences cloned on bacteriophage X are lost in 
competition with wild-type X during multiple replication cycles (R. Davis, personal 
communication). Of course, certain cloned DNA fragments may either increase, or 
at least not adversely affect, the survival potential of cells or vectors that possess 
them, but these instances seem to be rare (P « 10~ 3 ) compared to the total types 
of cloned DNA fragments that arise from a shotgun type of experiment in which 
the total genome of the donor is fragmented and introduced randomly into the 
cloning vector. 

adverse CONSEQUENCES In considering adverse consequences due to expres- 
sion of foreign DNA, I assume that expression can be achieved. If one then starts 
with random fragments of a foreign genome, I estimate that no more than one in 
1000 such fragments possess genetic information that would be deleterious to some 
other organism in the biosphere if expressed by a chimeric microorganism in the 
same ecological niche. One could also reduce this probability for expression of 
adverse consequences another 10-fold at least, because the deleterious gene product 
is unlikely to be secreted or excreted in a biologically active form or then to be taken 
up by the target organism where it can exert its adverse effect. Thus the overall 
probability that an escaped surviving random foreign DNA segment would manifest 
expression of adverse consequences for some organism in the biosphere is lfr 4 or 
less. It should be obvious, however, that this value should be modified depending 
on the source, purity, and known potential for expression of harmless vs harmful 
gene products of the cloned foreign DNA. 

One other cautionary comment should be raised about the potential for manifesta- 
tion of adverse consequences associated with recombinant DNA molecule research. 
Namely, will any of the supposedly beneficial applications of the technology create 
a biohazardous condition that would be more detrimental than the original problem 
being corrected? A meaningful answer to this question is probably premature, but 
it would certainly be worthwhile to give careful thought to this issue before advanc- 
ing too far in using recombinant DNA molecule techniques to solve the myriad of 
problems now confronting our society. 

SUMMATION In the foregoing treatment, I attempt to estimate all factors that 
would or could contribute to the display of a biohazardous situation by using, as 
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examples, the cloning of random DNA sequences of a eucaryotic organism with 
K coli K12 hosts and vectors. For EK1 plasmid-host systems, the limiting factor 
is the probability of survival of the host, and thus one can discount the lower 
probabilities for transmission of cloned foreign DNA in estimating the likelihood 
of biohazardous situations. With X vector EK1 systems, however, one should con- 
sider both the survival of the host if lysogenization and/or plasmid formation can 
occur, as well as the survival and perpetuation of the X vector. For EK2 plasmid- 
host systems in which the host theoretically cannot survive in nature, the probability 
of perpetuation of cloned foreign DNA is the product of the host survival and the 
transmissibility probabilities. The use of EK2 hosts with EK2 phage or plasmid 
vectors that either are dependent on the EK2 host or that kill other microorganisms 
into which they are introduced all but eliminate the probability for survival and 
perpetuation of the cloned DNA. In these instances the probabilities need to be 
multiplied times the total number of escaped chimeric microorganisms or viral 
vectors times the probability that the cloned DNA would not confer a selective 
disadvantage on hosts or viral vectors times the probability for displaying adverse 
consequences. 

Although these total estimated probabilities for an occurrence of a biohazard are 
exceedingly small, it should be reiterated that noncompliance with use of standard 
required practices for physical and biological containment makes these low esti- 
mates meaningless. For example, contamination of an EK2 host culture at the time 
of cloning foreign DNA could yield robust chimeric microorganisms with excellent 
chances for survival and perpetuation of cloned DNA. 

The same considerations for estimating biohazard probabilities can be made for 
cloning foreign DNA in other microorganisms such as Bacillus, Pseudomonas, 
yeast, and others. The favored routes for escape and ecological niches for survival 
and the organisms at risk in the biosphere are different than for K coli, but the 
ultimate long-term effects would be similar. 

INTENTIONAL RELEASE OF GENETICALLY 
ALTERED MICROORGANISMS 

For Beneficial Purposes 

Some foreseeable applications of the ability to clone foreign DNA in bacteria include 
use of these chimeric microbes for beneficial purposes to clean up environmental 
pollutants, to eliminate specific insect pests, or even to produce antibodies specific 
against toxic microbial products, such as the cholera enterotoxin as a means to cope 
with cholera infections. These beneficial purposes may be laudatory, although I 
consider such release any time in the near future to be potentially quite hazardous 
and therefore totally inadvisable. For one thing, a substantial amount of information 
would be needed to verify that the chimeric microorganism exhibited all the desired 
attributes and none of the potential undesirable attributes. Furthermore, the release 
of such a microorganism in one country is a de facto release in all other countries, 
irrespective of whether such other countries approve or not Therefore, some inter- 
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national authority needs to be established to govern or regulate such beneficial uses 
of chimeric microbes. 

For Nonbeneficial Purposes 

Although reprehensible and agonizing to contemplate, the technologies for con- 
structing chimeric microorganisms could be utilized to develop biological weapons. 
Despite the recent signing of treaties to destroy all biological weapons and cease 
further development of such agents, recent events have undermined the faith of the 
individual in the credibility of the governments of the world. One must therefore 
exhibit some degree of apprehension about the potential misuse of these technolo- 
gies. I would hope that anyone or any government contemplating such use would 
recognize that the consequences would be just as adverse for them as for any other 
group of individuals. 

COPING WITH POTENTIAL BIOHAZARDS 

Shortly after the publication of the Berg et al (18) letter calling attention to the 
potential biohazards associated with recombinant DNA molecule research, I began 
to consider the possibility that one might genetically alter E. coli as a host for 
recombinant DNA research to make it safer, a goal in keeping with our then-current 
intent to make E. coli more useful for recombinant DNA molecule research. This 
idea was communicated to a group of microbial geneticists and plasmid researchers 
who had been asked to draft a proposal for the conduct of recombinant DNA 
molecule experiments with procaryotic organisms for submission to the Asilomar 
Conference on Recombinant DNA Molecules. These concepts were discussed, ex- 
panded, and refined by this group and presented at this meeting. The concept of 
genetic manipulation of vectors and hosts to provide biological containment for 
recombinant DNA molecule research was persuasively advocated by S. Brenner at 
the Asilomar meeting and resulted in the adoption of this added means to contend 
with potential biohazards (9). 

Stated simply, as it would apply for the use of any microbial system for recombi- 
nant DNA molecule research, the host strain should be uniquely adapted to a 
carefully controlled laboratory environment so that it can neither survive nor trans- 
mit genetic information to other organisms in any environment in which it might 
naturally find itself. This obviously means that the host strain should not be able 
to survive in its normal ecological niches if these should be present in the laboratory 
setting or connected or adjacent to it; for example, intestinal microorganisms should 
not be able to colonize or survive in the intestine, organisms that normally reside 
on the skin should not be able to survive on the skin, organisms that normally live 
in soil or on plants should not be able to live in soil or on plants, and so on. In terms 
of transmissibility of genetic information, it is best to have a plasmid or viral cloning 
vector that is solely dependent for its replication on the genetically altered host and 
even to endow it with the potential to kill any other microorganism into which such 
DNA might be introduced. In so genetically modifying vectors and hosts, one 
should employ deletion mutations whenever possible and even double mutations 
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with the mutant alleles at widely spaced positions on the genome to diminish the 
likelihood of reversion of the trait or loss by genetic exchange and/or recombination 
with other organisms or vectors in nature. 

In the year since the Asilomar Conference, our laboratory group has endeavored 
to construct a number of E coli K12 strains that would provide these fail-safe 
features. At the commencement of these endeavors, the task seemed straightfor- 
ward; however, as we soon learned, appropriate deletion mutations did not exist and 
K coli had a will as well as a means to survive under adverse conditions when it 
should have dropped dead. It has thus required a total of 13 genetic manipulations 
to obtain a strain that should meet the requirements for an EK2 host as specified 
in the current proposed NIH Guidelines for Recombinant DNA Molecule Re- 
search. This strain cannot synthesize the murein layer of its cell wall, except under 
carefully controlled laboratory conditions because it has two mutations at different 
map positions; is unable to synthesize the mucopolysaccharide colanic acid that can 
facilitate the ability of E coli to survive in the absence of the murein layer of the 
cell wall; is sensitive to bile salts and detergents, undergoes thymineless death, and 
degrades its DNA in environments lacking thymine or thymidine; is resistant to a 
number of E. co//-transducing phages; is very sensitive to UV irradiation; is unable 
to act as a recipient in matings with donors possessing conjugative plasmids in five 
of the groups commonly or occasionally found in enteric microorganisms; and, 
finally, is defective as a recipient in matings with donors possessing other conjugative 
plasmid types. This strain still has a finite low but unmeasurable probability for 
transmitting recombinant DNA molecules to other organisms in nature, and we are 
therefore continuing our studies to design a completely fail-safe bacterial host — if 
that is at all possible. 

Our recent experience in this endeavor leads me to believe that it would be 
possible to genetically alter other microorganisms and also cloning vectors in such 
a way as to all but eliminate any and all of the potential biohazards that might be 
associated with recombinant DNA molecule research. This task would not be easy, 
however, and would require considerably greater amounts of information than are 
currently available on the genetics, metabolic capabilities, and ecological potentials 
of microorganisms that might be deemed worthy for use as hosts for recombinant 
DNA molecule research. Until such time, it is likely that E. coli K12 will continue 
to be the workhorse of modern molecular genetics. It should be readily admitted 
that many have criticized the use of E. coli for this research because of its intimate 
association with humans, its known potential for exchange of genetic information 
with strains of other gram-negative genera, and the potential of other non-laborato- 
ry-adapted strains of E coli to exhibit considerable virulence, pathogenicity, and 
communicability. Although I agree that these known attributes of E, coli make it 
less than totally acceptable for use in these experiments, these problems are well 
known and the means to effectively eliminate these problems have now been devel- 
oped. To choose another microbial host about which less is known under the 
expectation that such similar problems do not exist seems irresponsible to me. It is 
far better to know a sufficient amount of information in order to recognize and then 
eliminate all problems than to assume they do not exist and proceed on that basis. 
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DATA NEEDED TO EVALUATE POTENTIAL BIOHAZARDS 
OF CHIMERIC MICROORGANISMS 

Although information will eventually be learned during recombinant DNA mole- 
cule research that will permit assessments of potential biohazards, I believe that a 
more dedicated effort needs to be made by scientists engaged in this research to 
obtain as much of this information as soon as possible after construction of chimeric 
microorganisms or virus vectors. For example, experiments to determine whether 
the cloned DNA affects the survival of the host under permissive and nonpermissive 
conditions or to determine the transmissibility of the plasmid vector should have 
a high priority. Competition experiments can also be performed to determine 
whether a given recombinant DNA molecule causes a cell to grow slower or faster 
than a host just carrying the cloning vector or whether it causes a cell to display 
a selective advantage or disadvantage over other cells of the same or different species 
that lack the recombinant DNA. Studies can also be performed to determine 
whether the recombinant DNA is expressed and if so whether the products synthe- 
sized are deleterious to any organism that might share or provide an ecological niche 
for the chimeric microorganism or other organisms with which it can potentially 
exchange genetic information. Similar experiments can also be performed for viral 
vectors containing foreign DNA. In all these types of studies, one must consider the 
source of foreign DNA, the traits potentially specified, and the properties of the host 
microorganism or viral vector in evaluating which organisms might be at potential 
risk and which therefore should be studied for adverse effects caused by the host 
microorganism or viral vectors carrying foreign DNA. In association with measure- 
ments as to whether the presence of the cloned DNA alters the frequency with 
which the recombinant DNA can be transmitted to other microorganisms, it would 
also be desirable to gradually obtain additional data on the prevalence of microor- 
ganisms in nature which display sensitivity to viruses being used as cloning vectors, 
which possess nonsense suppressors (since these might permit replication of cloning 
vectors that are dependent on such suppressors), and which lack restriction systems. 

CONCLUDING REMARKS 

Recombinant DNA molecule research will undoubtedly contribute information that 
will revolutionize biological science and ultimately society in the years to come. The 
conduct of this research is associated with numerous potential biohazards that are 
difficult, if not impossible, to assess totally at this time because of a lack of needed 
information. Nevertheless, the application of established procedures and physical 
containment facilities for the conduct of this research, along with the use of dis- 
armed hosts and cloning vectors that are unable to permit the survival and perpetua- 
tion of cloned DNA sequences, should all but eliminate any adverse consequences 
of this research. In view of the lack of information about the basic ecology of 
microorganisms and their interactions with other species and about the deleterious 
attributes that might be associated with recombinant DNA, it is urged that scientists 
engaged in this research endeavor to learn as much about these potential biohazards 



Board Assigned Page #2000 



I 



JAN 29 2007 17:24 FR CISTI ICIST 

1 



613 952 9303 TO 12122308888 P. 27/2 



GENETIC MANIPULATION OF MICROORGANISMS 53 1 



as is possible, so that future assessments of hazards can be far less speculative. 
Although it now seems likely that members of the scientific community who plan 
to do this research have accepted a set of guidelines that maximize .benefits and 
minimize hazards, it is far less certain that similar guidelines have been or will be 
established to guide the commercial applications of this technology or the misuse 
of the technology for the preparation of military weapons. For these reasons I 
remain apprehensive. 
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I. Introduction 

Synthetic oligonucleotides have played a significant role in our 
understandmg of many current problems in molecular biology. Syn- 
hetac homopolymerxc oligodeoxynucleotides have been extensively 
used for studying the mechanism of action of various enzymes on DNA 
synthesis in vitrc >(1). Synthesis of all of the possible 64 rrhibonu 
cleoside diphosphates have been accomplished for the eluc^tiTof 
the genets code (2, 3). More recently, several segments of DNA pos 
sessing important biological functions have bee/chemically syX 

s pettiS r ne 7 6 l e two T nsfe i rna genes (4 > S >- an ****** 

opemtor DNA If ™f 0 e ™* ^onnone gene (7), several lactose 
operator DNA segments (8-13), and several restriction endonuclease 
recogn ltl on sequences (11, 14-16). The availability of these synthet" 

Teat r d S el S ^ *° undfiratand these 

greater depth. Short segments of oligodeoxynucleotides have also 

been synthesized as DNA primers for sequence analysis (17 18) 

In th 1S review we present a general picture of how 'synthetic 

oligonucleotides have been used to solve various pTobTems in 

n^:i: h ^:^ k r the , length «** ^ 

T , 7 c7l co ? fined ourselves to the area of oligodeoxvribonu 
c eotides of defined sequences, a field closer to our o^s^Z^Z 

1977 Wrar 111 " SU ^ ed indudeS arHcleS Polished Sore C 
1977 Several review articles covering chemical synthesis (18-21) and 
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II. Chemical Synthesis of Oligodeoxy ribonucleotides 

The synthesis of oligonucleotides involves the joining of nu- 
cleotides through phosphodiester linkages. During synthesis, internu- 
cleotidic bonds may be maintained either as phosphodiesters or phos- 
photriesters (Fig. 1). Both methods have been used extensively for the 
synthesis of oligonucleotides with defined sequences. 

Synthesis of a dinucleotide containing a (3' -» 5') internucleotide 
linkage was achieved for the first time in 1955 by condensing 3'-0- 
acetyl thymidine with 5'-0-acetylthymidine 3'-benzylphosphochlori- 
date (28). However, various difficulties were encountered in the pre- 
paration and purification of the neutral intermediate containing the 
phosphotriester functions. In 1958, Khorana et al. (29) achieved the 
synthesis of thymidylyl-(3' -» 5')-thymidine in high yield by intro- 
ducing a new condensing agent, dicyclohexylcarbodiimide (DCC) for 
the condensation of 3 '-O -acetyl thy mi dine 5'-phosphate with 5'-0- 
tritylthymidine. 



A. Phosphodiester Method 

1. Basic Approach 

The main feature of this approach is outlined in Fig. 2. DCC was 
the first condensing agent used. However, for the synthesis of longer 
chains, mesitylenesulfonyl chloride (30) and triisopropylbenzenesul- 
fonyl chloride (31) were introduced. In the synthesis of an oligonu- 
cleotide containing bases other than thymidine, it became important 
that functional groups such as the free hydroxyls of sugar moieties and 
the amino groups of adenine, cytosine and guanine be protected. To 
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FIG. 1. A scheme showing three methods for the chemical synthesis of 
oligodeoxynucleotides. 
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FIG. 2. The phosphodiester method of oligonucleotide synthesis. The synthesis of a 
trinucleotide is shown. 

protect the 5'-hydroxyl group of a deoxynucleoside, p-monomethoxy- 
or dimethoxytrityl groups were used. To protect the amino group of the 
heterocyclic bases, acylation of the deoxynucleoside followed by 
selective hydrolysis was employed (32). Using this approach, the syn- 
thesis of oligodeoxynucleotides containing homo- or repeating mixed 
polymers was achieved using chemical polymerization (33), whereas 
the synthesis of denned sequences of oligonucleotides was ac- 
complished by stepwise block condensation (34). Variations of the 
phosphodiester approach have been employed for the synthesis of 
several oligodeoxynucleotides of biological significance. These in- 
clude sequences corresponding to DNA coding for a derivative of the 
S-peptide of ribonuclease A (6), the hormone angiotensin 11(7), frag- 
ments containing natural sequences of the phage T4-gene (35^38), and 
the X-endolysin gene (39); and, most notably, all the fragments for the 
alanine tRNA gene, (4) and for the tyrosine tRNA gene and its con- 
trol regions (5, 40). 
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Although the phosphodiester method is useful and reliable, it is 
inherently laborious and gives low yields of the longer oligodeoxynu- 
cleotides for the following reasons. The synthetic intermediates are 
ionic diesters, insoluble in most organic solvents, and the common 
method of isolation involves prolonged chromatography on DEAE- 
cellulose column. A more serious problem is caused by the presence in 
intermediate fragments of the nucleophilic phosphodiester function, 
which becomes activated by the condensing reagents (such as arylsul- 
fonylchloride) and causes chain breakage (41 ). In order to obtain rea- 
sonable yields, excess of the incoming nucleotide block must be used, 
making the diester method inefficient. 

Several recent improvements have been made in the diester 
method to facilitate the isolation of the product. These include aro- 
matic protecting groups to exploit affinity columns (42), solvent extrac- 
tion (43), "high-pressure" liquid chromatography (44), and reverse- 
phase "high-pressure" liquid chromatography (45). 

2. Polymer-Support Synthesis 

In spite of the great success of the polymer- support method in 
peptide synthesis, little related progress has been reported in 
oligonucleotide synthesis. This is because of the decreased yield at the 
condensation step with increasing chain length. Recently, the synthe- 
sis of a heptanucleotide has been achieved (46) in modest yields by 
using a cross-linked polydimethylacrylamide resin. This resin con- 
tains the amino group of /3-alanine residues present as butoxycarbonyl 
derivatives, which serve as anchoring points. Next, it is treated with 
jS-hydroxythioether, allowing the substituted resin to act as a phos- 
phate protecting group. The chain extension reactions follow a phos- 
phodiester approach by reaction of the support with pyridinium 
3'-0-acetylnucleoside 5'-phosphates preactivated with isopropyl 
benzensulfonyl chloride. This method has undergone further im- 
provements by using a modified Beckman 990 Peptide Synthesizer, 
resulting in a cycle time of 24 hours per nucleotide addition. Two 
nonadeoxyribonucleotides, d(pT-T-C-T-G-T-T-G-A) and d(pG-G-A- 
G-G-A-G-A-A), required as primers for sequencing mouse immuno- 
globin mRNA, have been prepared by this approach (Gait and Shep- 
pard, personal communication). 

B. Phosphotriester Method 

1. Earlier Methods 

The synthesis of the first dinucleotide was achieved by the triester 
method by Michelson and Todd (28) in 1955, although the yield was 



Board Assigned Page #2009 



29 2007 17:30 FR CISTI ICIST 



613 952 9303 TO 12122308888 



P. 08/43 



106 



R. WU, C. P. BAHL, AND S. A. NARANG 



poor. In 1967, the phosphotriester method was used by Letsinger and 
Ogilvie (48, 49) for the synthesis of a hexathymidylate in a reasonable 
yield. This' method has been extensively studied in various labora- 
tories (50-54). However, no biologically important oligodeoxynu- 
cleotides had been synthesized by this method until 1973, when impor- 
tant modifications were introduced (55). Since then, the modified t 
phosphotriester method has become a more convenient method for 
synthesis of oligonucleotides of defined sequences. ^ 1 

The triester method involves phosphorylation of the 3'-hydroxy 
group of a 5'-protected mononucleoside with a phosphate 
monoesterified by some group such as /3-cyanoethyl, o-chloro, or 
p-chlorophenyl, followed by subsequent condensation with the pri- 
mary 5'-hydroxyl group of a suitably 3'-protected nucleoside. Since 
each internucleotide bond in the product is present as a triester func- 
tion, isolation and purification by the more conventional techniques of ■ 
organic solvent extraction and silica-gel chromatography became pos- ; 
sible. However, it was observed that, owing to incomplete phosphoryl- 
ation in the first stage, subsequent coupling with 5'-protected nu- 
cleoside led to a complicated reaction mixture. Since these mixtures 
could not be completely resolved on conventional silica-gel columns, 
the advantages of large-scale synthesis and high yields of product 
were somewhat nullified. 

2. Modified Phosphotriester Method 

To overcome the above difficulty, Narang ef al. modified (55) the 
"one-pot" triester approach of phosphorylation and coupling (48) to a 
"two-step" sequential coupling procedure. The basic feature of this 
procedure is the starting of synthesis of an oligodeoxynucleotide from 
fully protected mononucleosides containing a fully masked 3'- 
phosphate group (Fig. 3). Since the resulting intermediate oligonu- 
cleotides synthesized contain a fully masked 3 '-phosphate group, the 
necessity for phosphorylation at each condensation stage is eliminated. 
A similar modification has been employed by Cramer et al. (56). 

Phosphorylation. The initial phosphorylation of the 3'-hydroxy component was car- 
ried out with te(triazoyl)-p-chlorophenyl phosphate (57) followed by the addition 
of /i-cyanoethanol. 

Coupling Reagents. For the triester method, triisopropylbenzenesulfonyl chloride 
was the condensing reagent initially used (59). The introduction of arylsulfonyls 
activated by triazole (8, 58), nitrotriazole (Narang and Stawinsky, unpublished), and 
tetrazole (60), resulted in higher yields with a minimum of side products. The 
corresponding imidazolides (61) and nitroimidazolides (62) for the synthesis of 
oligonucleotides have also been used. 

Synthesis of Oligodeoxynucleotides. A 5'-dimethoxytrityl monodeoxynucleoside 



Board Assigned Page #2010 



JAN 29 2007 17:31 FR CISTI ICIST 



613 952 9303 TO 12122308888 P. 09/43 



NARANG 

lger and 
isonable 
; labora- 
.eoxynu- 
n impor- 
nodifled 
■thod for 

•hydroxy 
losphate 
iloro, or 
the pri- 
le. Since 
iter func- 
liques of 
ame pos- 
□sphoryl- 
;cted nu- 
mixtures 
columns, 
f product 



d (55) the 
I (48) to a 
re of this 
)tide from 
asked 3'- 
; oligonu- 
*roup, the 
Liminated. 
(56). 

ient was car- 
the addition 

nyl chloride 
irylsulfonyls 
jlished), and 
•oducts. The 
synthesis of 

:ynucleoside 




(I) .Pr 3 PhSO^-lst'rojole 

Ssltca -gel chromotograpEiy 
{in) 60% ocet.c ocrd 

(IV) 0.21V NoOH-dit»ona 

(V) cone Nri^ 
(VI! Sephodex 
WTLC on cellulose 

Fig. 3. The modified phosphotriester method of oligodeoxynucleotide synthesis. 
Triisopropylbenzenesulfonyltetrazole (iPr 3 PhS0 3 -tetrazole) is the condensing agent. 

3'-cyanoethyl p-chlorophenyl phosphate is the starting material for oligonucleotide 
synthesis in the triester method. It was essential to develop conditions for the 
selective removal of the acid-labile group without depurination and for removal of 
the £-cyanoethyl group without affecting the internucleotide phosphate protecting 
group, i.e., the p-chlorophenyl group. In the first case, we found that benzenesul- 
fonic acid in chloroform (11) selectively removed monomethoxy- and dimethoxy- 
trityl groups at 0°C with a minimum of depurination. Mild treatment with sodium 
hydroxide/dioxane/water was found to be satisfactory for removal of the 
0-cyanoethyl groups only in smaller oligonucleotides, while treatment with 
anhydrous triethylamine pyridine (63) was found to be highly effective for longer 
oligonucleotides. A similar observation has been reported independently (64) for 
the selective removal of /3-cyanoethyl groups from fully protected 
monoribonucleotides. After each condensation step, the reaction mixture was chro- 
matographed on a silica-gel column. After complete deblocking under milder con- 
ditions (11), the isolation of the desired product was carried out on a Sephadex 
gel-filtration column followed by polyethyleneimine-cellulose thin-layer chroma- 
tography (65). 

3. Phosphite-Triester Method 

The key feature of this new approach (66) is the coupling of nu- 
cleosides by reaction with a phosphodichlorite (a trivalent phosphorus 
reagent). At the end of the synthesis, phosphite intermediates are 
oxidized to phosphotriesters (Fig. 1, C). The potential advantages of 
this approach are: the short time required for the completion of the 
reaction, the low temperature for the condensation reaction, the rela- 
tively high selectivity of phosphochlorite reagents, and favorable 
yields. 




i 



Board Assigned Page #201 1 



29 2007 17:31 FR CISTI ICIST 613 952 9303 TO 12122308888 P. 10/43 



lOg R. WU, C. P. BAHL, AND S. A. NARANG 

III. Enzymic Synthesis or Joining of Oligodeoxynucleotides, 
and the Use of Synthetic Oligonucleotides 
for Studying Enzymes 

The chemical synthesis of oligodeoxynucleotides when used in 
conjunction with enzymic methods permits the preparation of longer 
segments and sequence variations with minimal effort. Through the 
use of these oligonucleotides, a great deal has been learned about the 
mechanism of enzymic action. 

A. DNA Ligase 

DNA ligase (polynucleotide ligase) obtained from both uninfected 
and T4-phage-infected Escherichia coli catalyzes the covalent joining 
of two molecules of oligodeoxynucleotide or DNA. The 5'-P from one 
molecule is joined to the 3'-OH of another to form a phosphodiester 
bond (67-69). Since it is difficult and time consuming to synthesize 
oligodeoxynucleotides longer than 15-20 units, a strategy by which 
short synthetic oligodeoxynucleotides (usually 8-12 bases in length) 
are joined with the use of DNA ligase has been devised (4, 70-72) (see 
Fig. 4). This method was used to construct a double- stranded DNA 
molecule approaching 200 base-pairs in length that included the E. 
coli precursor tyrosine tRNA gene, together with its promoter and 
termination regions. A minimum of four overlapping nucleotide pairs 
are required for the joining of oligodeoxynucleotides with DNA 
ligase. 

A second type of DNA ligase activity, which joins even-ended 
double-stranded DNA molecules, was discovered as a result of the use 
of T4 DNA ligase for joining synthetic oligodeoxynucleotides of de- 
fined lengths and sequences (73, 74). This "blunt-end" joining of DNA 
molecules works only for T4 DNA ligase, and does not require over- 

5 ' I 1 1 [ I 1 1 I 1 1 1 [ 3 ° H p 'l 1 I I I I I I I 1 M 1 ll' <* 

™ 3 '-H-H H+-H 5 . 

in 

Fig. 4. The joining of short oligodeoxynucleotides with DNA ligase. Each short 
vertical bar represents a nucleoside. The horizontal lines represent the phosphodiester 
backbone of the oligodeoxynucleotides. In this scheme, the 3'-OH group of segment II 
is joined to the 5'-P group of segment I using DNA ligase. Segment III serves as a 
template for holding segments II and I in adjacent position by base-pairing. Once the 
joining has been completed, a segment IV can be lined up on the right-hand side of 
segment III and joined to segment III using segment I as a template. This procedure can 
be repeated to build a very long sequence. 
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lapping single-stranded regions. Such ligase activity is markedly 
stimulated by T4 RNA ligase (75) and has been used for joining DNA 
molecules for molecular cloning ( 1 5, 76, 77). (A more extensive discus- 
sion on the use of synthetic oligodeoxynucleotides for cloning appears 
in Section VII.) 

B. DNA Polymerase 

E. coli DNA polymerase I catalyzes a template-directed "repair" 
synthesis (78-81) that serves to add specific nucleotides to a primer 
DNA. At 5°-8°C, the incorporation reaction stops when the single- 
stranded section is completely repaired (see Fig. 5), and no initiation 
of new chains occurs. Partial repair synthesis occurs when only one, 
two, or three of the four deoxynucleoside triphosphates are used. 

Repair synthesis can be used to complete a chemically synthesized 
structure, as shown in Fig. 5, thereby making the synthesis of the 
21-nucleotide-long duplex easier. More important, partial repair syn- 
thesis on structure I (in Fig. 5) can produce duplex structures of differ- 
ent lengths (e.g., 13, 14, 16, 17, 19, or 21) with relative ease (10). 
Complete synthesis by chemical means of structures of all inter- 
mediary lengths would be extremely time-consuming. In certain 
molecules, repair synthesis with E. coli polymerase failed to incorpo- 
rate the last nucleotides (71). The use of AMV or RSV reverse tran- 
scriptase for repair synthesis resulted in more complete incorporation 
(10). 



OH 



3' 

3' T-T-A-A-C-A-C-T-C-G-C-C-T-A-T-T-G-T-T-A-A 5' 

+ dATP + DBA polymerase I 
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II 



5' 
3' 
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_A-OH 3' 
T-A-T— T-G-T-T-A-A 5' 



+ 4 dSTP + DNA polymerase I 
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T-A-T-T-G-T-T-A-A 5 ' 



Fig. 5. Repair synthesis of the single-stranded end of a lac operator DNA. Structure 
I represents a 12-nucleotide-long primer hybridized to a 21-nucleotide-long template. 
When only one deoxynucleoside triphosphate is used for repair synthesis, partial syn- 
thesis occurs. This addition of one molecule of pdA gives structure II (a 13-nucleotide 
duplex). When all four deoxynucleoside triphosphates are used, the repair synthesis is 
complete and a fully double-stranded DNA molecule is formed (structure III). 
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Another advantage of repair synthesis is the capability to introduce 
modified bases at specific locations. For example, if structure II is 
incubated with 5-Br-dUTP (9) and DNA polymerase I, a 5-Br-dU will 
be incorporated into position 14 in place of a dT. If 4-S-dTTP (82) is 
used as the substrate in place of dTTP, structure II (Fig. 5) will contain 
a 4-S-dT at position 14. The effect of these specific base changes on the 
biological properties of the DNA can then be tested. 

C. RNA Ligase 

T4 RNA ligase joins RNA chains in the absence of a template (83). 
This enzyme also catalyzes the end-to-end joining of single-stranded 
DNA, a reaction confirmed with the use of synthetic oligodeoxynu- 
cleotides (84). The lack of a template requirement makes RNA ligase an 
attractive reagent for the synthesis of oligodeoxynucleotides with de- 
fined sequences. RNA ligase catalyzes the end-to-end ligation of 
(dT) 

12-18 efficiently. However, the rate of this reaction was found to 
diminish with decreasing chain length of the substrate (84). 

D. Polynucleotide Phosphorylase 

E. coli polynucleotide phosphorylase catalyzes the addition of a 
few deoxyribonucleotides to the 3'-OH end of an oligodeoxynu- 
cleotide primer (85). Deoxynucleoside S'-diphosphate, Mn 2+ , and a 
primer at least three nucleotides in length are required for the reaction 

(86) . This reaction has been used for the stepwise addition of nu- 
cleotides to a d(pT-T-A-G) primer to synthesize a tridecanucleotide 

(87) , d(pT-T-A-G-C-A-G-A-A-C-C-G-G), in six steps, as shown in Fig. 6 
(1977). In some of the steps, two nucleotides of the same type are 
added in a single reaction. Because of the large number of steps in- 
volved, the overall yield of the final product is rather poor. This 
method is perhaps more useful in synthesizing variations of a chemi- 
cally prepared sequence: for example, a specific nonanucleotide can 
be converted to four different decanucleotides with A, G, T, or C at the 
3' end. 

E. Other Enzymes Studied with the Aid of Synthetic 
Oligodeoxynucleotides 

A synthetic octanucleotide, d(A-C-C-A-T-C-C-A), was employed to 
develop a useful method for labeling the 3' ends of oligodeoxynu- 
cleotides or DNA. Terminal transferase was used (88, 89) to catalyze the 
addition of a single [ 32 P]ribonucleotide to the 3' end of the octanu- 
cleotide to give d(A-C-C-A-T-C-C-A) 32 p-rN, where N is any one of the 
four nucleosides. The 3'-labeled oligonucleotide or DNA can then be 
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Fig. 6. Enzymic synthesis of oligodeoxynucleotides of defined sequence. * This 
pentanucleotide was purified on a DEAE-cellulose column to remove the diaddition 
product, d(pT-T-A-G-C-C), and the starting material. The same column purification was 
used after each step: (2), (3), (4), (5), and (6). In order to improve the yield, the unreacted 
primer was isolated at each step and recycled in one or several repeated synthetic steps. 
| This tridecanucleotide constitutes a segment of the yeast iso-cytochrome c gene. 

used directly for sequence analysis. Alternatively, it may be used as a 
primer for further extension with deoxynucleotides. Subsequently, the 
extended segment may be analyzed after its cleavage from the primer 
at the ribonucleotide linkage (90-92). 

The mechanism of action of E. coli exonuclease III was studied by 
hybridizing (dA)iooo with oligothymidylate labeled at the 3' end with 
one, two, or more ribonucleotides. It was found that exonuclease III 
can cleave one or two consecutive ribonucleotides regardless of 
whether the ribonucleotides are base-paired or mismatched (93). 

Oligodeoxynucleotides of defined sequences such as d( 32 pA-C- 
A-A-T-T) and d( 32 pA-A-T-T-G-T-G-A-G-C-G-G) have been used to 
study the 3' -» 5' exonuclease activity of E. coli DNA polymerase I. It 
was found that at pH 7.1 and 5°C, DNA polymerase I does not digest 
d(pA-C-A-A-T-T) but degrades longer oligodeoxynucleotides as well 
as hexathymidylate to tetranucleotides. This indicates that digestion is 
dependent on both the size and sequence of the oligonucleotides. At 
15°C or higher temperature, every oligonucleotide tested (hexanu- 
cleotides or longer oligonucleotides) produced a terminal 5'- 
dinucleotide as the major product (Bahl and Wu, unpublished). 

IV. Synthetic Oligodeoxyribonucleotides and DNA 
Sequence Analysis 

A. Primer Extension Method for DNA Sequence Analysis 

The sequence of a DNA molecule from a specific location can be 
determined by either the end-labeling approach or the primer- 
extension approach. 

1. The End-Labeling Approach 

For sequence analysis from the 5' end of a DNA molecule, the 
5'-OH group can be labeled with 32 P using [y- 32 P]ATP and 
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polynucleotide kinase (94, 95). After labeling, the two 5' ends of a 
double-stranded DNA can be separated either by strand separation or 
by cutting the DNA molecule with a restriction enzyme, followed by 
separation of the two ends (17, 25). Each purified 5'-end-labeled DNA 
can be sequenced using one of three methods: exhaustive digestion 
with pancreatic DNase I plus venom phosphodiesterase, or E. coli 
exonuclease I, to determine the sequence of mono- or dinucleotides 
(17, 25, 94, 96, 97); partial enzymic digestion followed by mobility 
analysis to determine sequences up to 15 nucleotides in length (39, 
98-100); or partial chemical digestion followed by gel electrophoresis 
to determine up to 120 nucleotides from the labeled 5' end (101). 

For sequence analysis from the 3' end of a DNA molecule, the 3' 
end can be labeled by introducing a single deoxynucleotide (102-105) 
or a single ribonucleotide (89, 106-108). Sequence analysis can then 
proceed from the labeled 3' end using the same principles described 
for analysis from the 5' end. 
2. The Primer-Extension Approach 

For sequence analysis starting from an internal portion of a long 
DNA molecule, specific primers can be used. The principle for the 
primer-extension method described by Wu et al. (81,109) is illustrated 
in Fig. 7. To analyze the DNA sequence from a specific location, such 
as nucleotide 120, an oligodeoxynucleotide primer is first hybridized 
onto the template strand starting at nucleotide 120, and then enzymi- 
cally extended toward nucleotide 240 by repair synthesis (102, 110- 
113). By determining the sequence of the extended region (dotted 
line), the sequence of the complementary template strand can be de- 
duced. 

The oligodeoxynucleotide primer of defined sequence can be syn- 
thesized chemically according to DNA sequences derived from sev- 
eral sources: a known protein sequence coded by the specific region of 
a gene (39, 109, 110, 114), a known RNA sequence (72, 111, 115-117), 
or a short known DNA sequence (90, 118, 119). 
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Fig. 7. Primer-extension approach for DNA-sequenee analysis. 
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Fig. 8. The plan for sequencing the promoter region of the tyrosine tRNA gene. The 
1-strand of 080 psu + uiDNA was used as the template strand for primer elongation. A 
synthetic 21-nucleotide-long primer was used to prime DNA synthesis by DNA 
polymerase. With this primer, a sequence up to nucleotide 39 in the promoter region 
was determined. Then a second synthetic primer in the newly sequenced region was 
used to determine the rest of the sequence up to nucleotide 59 in the promoter region 
(143). 

A synthetic octanucleotide, d(A-C-C-A-T-C-C-A), was used as a 
primer to develop a primer-extension method for DNA sequence 
analysis (110). This method involves the enzymic incorporation of a 
ribonucleotide at specific locations (ribo-substitution method) to facili- 
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FlG. 9. The use of DNA fragments as primers for sequence analysis. In step I, a 
double-stranded DNA molecule is digested by a specific restriction endonuclease to 
produce five specific DNA fragments. The fragments are separated by gel elec- 
trophoresis according to size. In step II, the two strands of a double-stranded DNA are 
separated. For sequence analysis of segment A, the lower strand is used as the template 
and is hybridized with heat-denatured fragment C. Enzymic extension of fragment C by 
repair synthesis will allow sequence analysis of the initial part of segment A {e.g., 
nucleotides 1-120). Steps III and IV, see Section IV, A, 2 in text. 
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tate the analysis of long sequences. Using fd DNA as template and the 
synthetic octanucleotide as primer, a sequence of 50 nucleotides was 
determined. 

Synthetic oligodeoxynucleotide primers corresponding to the E. 
coli tyrosine tRNA sequence were employed for sequence analysis. 
Using the separated strands of 080 pSu^ DNA as template and syn- 
thetic primers, the sequence at the promoter region (90) of the tyrosine 
tRNA gene was determined as shown in Fig. 8. 

Specific DNA fragments generated by the restriction endonuclease 
digestion of a DNA have proved to be useful primers for DNA se- 
quence analysis (120-122). Sanger et al. thus achieved the sequencing 
of <f>X 174 DNA, a monumental task involving approximately 5000 nu- 
cleotides (122-124). 

The method described by Sanger and Coulson (113) can be used to 
determine up to 120 nucleotides starting from a restriction enzyme 
recognition site (such as the C/A junction shown in Fig. 9). If fragment 
A is longer than 300 nucleotides and has no other known restriction 
enzyme recognition sites, then the center region of fragment A can be 
analyzed by synthesizing a decanucleotide primer corresponding to 
the known DNA sequence (e.g., nucleotides 111-120 in Fig. 9, step 

III) . Using this synthetic primer, the sequence of fragment A from 
nucleotide 120 to approximately 230 can be determined. Nucleotides 
230-340 can be determined from the opposite direction by using the 
upper strand as the template and fragment D as a primer (Fig. 9, step 

IV) . 

The DNA molecule used for primer-binding and extension may be 
a single-strand DNA isolated from a double-stranded DNA (101, 115, 
120, 125), a double-stranded DNA made partially single-stranded by 
E. coli exonuclease III digestion (38, 39), or a single-stranded circular 
DNA (110, 121, 123). 

B. Synthetic Oligodeoxynucleotides as Tools for Developing New 
Methods for DNA Sequence Analysis 

A synthetic decanucleotide, d(A-G-A-A-A-T-A-A-A-A), was utilized 
as a primer for developing a rapid new method for DNA sequence 
analysis, commonly known as the "plus-minus" method (113) (Fig. 
10). DNA polymerase I is first used to extend the primer oligonu- 
cleotide and copy the template in the presence of the four deoxynu- 
cleoside triphosphates, one of which is labeled with 32 P. This synthesis 
produces oligonucleotides of different lengths, all starting from the 
3'-OH end of the primer. The mixture is then passed through a small 
agarose column to remove the deoxynucleoside triphosphates. The ex- 
tended primers are subsequently treated in one of two ways. 
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-C-T-A-G+C+T+A+G 



FlG. 10. The principle of the 

am. 



'plus-minus" method of DNA sequence analysis 



1. The Minus System 

This system utilizes the partial-repair principle introduced for 
determining the sequence of "sticky" ends of bacteriophage A (79). 
The random mixture of oligonucleotides, still hybridized to the 
template DNA, is incubated with DNA polymerase I in the presence 
of three deoxyribonucleoside triphosphates. Synthesis then proceeds 
as far as it can on each chain and stops when points are reached where 
the missing triphosphate is required. Thus, if dATP is the missing 
triphosphate, each chain will terminate at its 3' end at a position pre- 
ceding a dA residue. Four separate samples are incubated, each with a 
different one of the four triphosphates missing. 

DNA molecules in the four incubation mixtures are then denatured 
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to separate the newly synthesized strands from the template, and sub- 
jected to electrophoresis on polyacrylamide gels containing 8 M urea. 
This fractionation system arranges all the oligonucleotides having a 
common 5' end according to chain length. Ideally, each oligonu- 
cleotide should be separated from that neighbor differing in length by 
one nucleotide. The radioautograph from the minus-dA mixture con- 
tains bands corresponding to positions preceding the dA residue in the 
synthesized chain. Reactions involving the other three minus systems 
are examined in a similar fashion. By comparing the positions of the 
bands on the radioautographs of the four mixtures, the complete DNA 
sequence may be deduced. Since this system alone is not sufficient to 
establish the sequence unambiguously, a second system is used in 
conjunction with it. 

2. The Plus System 

The "plus" system makes use of the property of T4 induced DNA 
polymerase to degrade double- stranded DNA from its 3' end. By the 
addition of a single deoxynucleoside triphosphate, the degradation is 
effectively stopped. This is due to the constant replacement of a nu- 
cleotide in the DNA by incoiporation of the added nucleotide (103, 
126). This method is applied to the random oligonucleotide mixture 
obtained above. Samples are incubated with T4 DNA polymerase and 
one of the four triphosphates and then fractionated by electrophoresis 
on polyacrylamide gel. For example, in a plus-dA mixture, all chains 
will terminate with a dA. The position of the chains will be indicated 
by bands on the radioautograph. Usually these bands will be one nu- 
cleotide longer than the corresponding bands in the minus-dA system. 
If there is more than one consecutive dA, the distance between the 
bands in the minus-dA and plus-dA system will indicate the number 
of such nucleotides. Thus, by a combined use of the plus and minus 
systems, it is possible to deduce the sequence of a molecule more than 
one hundred nucleotides in length starting from a synthetic primer. 
Using this method together with fragments generated by restriction 
enzymes, a large portion of the genome of 0X174 has been determined 
(122-124). 

C. Synthetic Oligodeoxyribonucleotides as Standards for 
Developing Methods for DNA Sequence Analysis 

The 2-D (two-dimensional) electrophoresis-homochromatography 
method (127-1 29) has been extensively used for the fractionation of Tl 
and pancreatic ribonuclease digestion products of RNA molecules. 
This method has also been used for the sequence analysis of 
pyrimidine tracts in DNA up to 20 nucleotides long (98), and sub- 
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sequently has been adapted for sequence analysis involving all four 
nucleotides. However, owing to the overlap between mobility shifts 
exhibited by different nucleotides (Fig. 11), and the subjectivity of 
visual inspection, it is often difficult to assign the correct sequence. To 
solve this problem, a quantitative mobility- shift method using syn- 
thetic oligonucleotides of defined sequences as standards has been 
developed (100). Although the recently developed chemical method 
of Maxam and Gilbert (101) and the plus-minus method of Sanger and 
Coulson (113) can determine a long DNA sequence in a relatively 
short time, the 2-D electrophoresis-homochromatography system is by 
no means obsolete. This system is especially useful for sequencing the 
terminal five to ten nucleotides of long DNA fragments that are often 
difficult to sequence by the chemical method (101), and for determin- 
ing short sequences at restriction endonuclease recognition sites (108, 
130). 

1. The Qualitative 2-D Fractionation System 

For the fractionation of oligoribonucleotides by the 2-D method 
(127-129), the first dimension is electrophoresis on a cellulose acetate 
strip at pH 3.5, which separates nucleotides on the basis of charge. 
The second dimension is homochromatography on DEAE-cellulose 
thin-layer plates, which separates nucleotides on the basis of length. 
As shown in Fig. 11, the addition of pdC in this system (98) causes a 
shift to the right. The addition of a pdA causes no or only a slight shift 




Fig. 11. The usual pattern of mobility- shift of an oligodeoxyribonucleotide (n) ex- 
tended by a single nucleotide on two-dimensional (2-D) electrophoresis- 
homochromatography. The first dimension is electrophoresis on cellulose acetate at pH 
3.5, and the second dimension is homochromatography; SR and 8Y represent the dis- 
tances for addition of purine nucleotides and pyrimidine nucleotides, respectively, 
fractionated in the second dimension. 
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to the left or right, while the addition of pdT or pdG causes a larger 
shift to the left. 

This method has been used extensively for sequence analyses of 
oligodeoxynucleotides. However, in many cases the visual inspection 
of shifts did not give conclusive results. Unambiguous sequences were 
obtained only after confirmation by other methods, such as end-group, 
nearest-neighbor, or pyrimidine-tract analysis. To convert it into an 
independent quantitative tool, the mobility-shift method was im- 
proved by the use of synthetic oligonucleotides of defined sequence to 
establish quantitative parameters (100). The major improvement was 
in the way the electrophoretic mobility shifts are interpreted. Instead 
of relying on visual inspection, the idea of calculating expected 
mobilities was developed. By comparing the observed with the calcu- 
lated mobility-shifts for various oligonucleotides, a sequence could be 
deduced without ambiguity. We have also modified the homo- 
mixtures (131) to make more clear the distinction between purine and 
pyrimidine shifts in the second dimension of this 2-D method. 

2. The Quantitative 2-D System for Sequence Analysis (100) 

The calculation of electrophoretic mobilities was based on the fol- 
lowing considerations. A molecule in a fluid subjected to a voltage 
gradient E moves with a velocity U = EQ/K, where Q is the net charge 
on the molecule at a given pH, and K is a constant that increases 
proportionally to the size of the molecule. All mobilities were ex- 
pressed with respect to an internal standard of [ 14 C]pdT, which has a 
charge of -1 at pH 2-5. The electrophoretic mobility of an 
oligodeoxynucleotide on cellogel at pH 3.5 relative to pdT can be 
written as U t = QIK. The average K values (K) for every length of 
nucleotides, n, was determined experimentally with the use of a 
number of different synthetic oligodeoxynucleotides of defined se- 
quence. Each 5'- 32 P-labeled oligomer was partially digested with 
venom phosphodiesterase. The products of digestion were then frac- 
tionated on the 2-D electrophoresis-homochromatography system 
under identical conditions. As the base composition and sequence of 
each partial product was known, the net charge Q = % n q could be 
determined (where q is the charge of each mononucleotide). The 
observed mobility was calculated by the following relation (100, 132): 

rjous _ distance traveled by oligomer in the first dimension 
7 distance traveled by pdT in the first dimension 

Since both U t and Q could be measured for any given oligonu- 
cleotide, the value of K n (n = 1,2,3, ■ • ■) could be obtained through 
the relation K n = QHJ t . By determining the K n values for each oligonu- 



Board Assigned Page #2022 



JAN 29 2007 17:37 FR CISTI ICIST 

T 



613 952 9303 TO 12122308888 P. 21/43 



JARANG 

. larger 

yses of 
pection 
;s were 
-group, 
into an 
'as im- 
ence to 
?nt was 
Instead 
:pected 
3 calcu- 
Duld be 
homo- 
ine and 

:s (100) 

the fol- 
voltage 
: charge 
icreases 
r ere ex- 
;h has a 
of an 

can be 
:ngth of 
.se of a 
ned se- 
3d with 
len frac- 

system 
_ience of 
:ould be 
le). The 
X), 132): 

i 

oligonu- 
through 
oligonu- 



SYNTHET1C OLIGODEOXYNUCLEOTIDES 



119 



hi 
i.i? 

1.40 
1,11 
1,19 
1.20 



1.22 
1.G9 

1,06 
1.11 
1.22 



1.33 

1.3-3 
1 . 3-S 
1.23 
109 
1.15 



1,00 
1.13 
1.27 



0.02 

-0.01 

C.0» 
0.12 

-0.01 
0,02 

-O.04 
0.!3 
0.Q1 

-0.05 

-O.I1 

0.2ft 



-0,02 

0.12 

0. IU 
-0.06 

O.02 
-O.06 

O. 14 



-0. 13 

-0.14 



• u 

» 13 

• 12 



• n 



• 9 



• 7 



• 7 



FlG. 12. Two-dimensional electrophoresis-homochromatography for sequence 
analysis. The 14-mer, d( 32 pA-G-T-C-C-A-T-C-A-C-T-T-A-A), was partially digested by 
snake venom phosphodiesterase and electrophoresed on cellulose acetate at pH 
3.5. The second dimension is on DEAE-cellulose thin-layer plate with homo-mixture IV 
(131). The mobility shift is calculated according to Tu et al. (100). pT, etc., mark the 
location of 14 C-labeled nucleotides used as markers. 



cleotide up to a length of 15 nucleotides, the average K n value (KJ for 
each n was obtained (100). Thus, using these average values for K n , the 
mobility of any oligonucleotide can be calculated. From the values, 
the predicted mobility-shift between two adjacent nucleotides n and 
n + 1 differing by one mononucleotide unit can be calculated through 
the relation S calc = U TM+1) - V c t}h- The mobility-shift is characteristic 
of the mononucleotide unit added to an oligomer of length n to give 
oligomer of length n + 1. By comparing the observed mobility- shift 
value, S ot)S ~U^ n+u - U^, with the S calc for the addition of any one of 
the four mononucleotides, the sequence of the oligomer can be unam- 
biguously derived. An example of the mobility- shift calculation for 
sequence analysis is shown in Fig. 12. Quantitative mobility- shift 
analysis has been used extensively in the authors' laboratory for the 
sequence analysis of tumor virus DNA (108, 132a). 

V. Synthetic Oligodeoxyribonucleotides and 
the Study of Genes 

A. Total Synthesis of Genes 

The most extensive work in the field of synthetic oligonucleotides 
has been done in Khorana's laboratory. One of his efforts has been 
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directed toward the synthesis of biologically functional DNA, i.e., 
genes capable of giving rise to gene products. This work has resulted 
in the development of a large number of techniques in chemical and 
enzymic synthesis of DNA. During the past decade, Khorana et al 
have reported the synthesis of two transfer RNA genes, one for the 
yeast alanine tRNA and the other for the E. coli tyrosine tRNA (4, 5). 

Work on the synthesis of the yeast alanine tRNA gene (4) was initi- 
ated in 1965. The choice of this gene for synthesis was based on the 
fact that it was the first tRNA whose sequence was known (133). Thus, 
the sequence of the deoxynucleotide in the gene could be derived 
directly. Moreover, tRNA molecules are interesting in that they are 
recognized by a large number of components of the protein- 
synthesizing machinery. These include aminoacyl-tRNA synthetases, 
the nucleotidyl transferases that repair the C-C-A end of the tRNAs, 
the ribosomes, and RNAs. The synthesis and study of the tRNA gene 
would serve to provide useful knowledge about its structure and func- 
tion. The strategy involved in the synthesis of the 77-base-pair-long 
duplex DNA structure (Fig. 13) involved three steps, (a) Chemical 
synthesis of 15 oligodeoxynucleotide segments ranging from 8 to 20 
nucleotides in length. These segments were to comprise both strands 
of the intended DNA. Those belonging to the complementary strands 
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Fig. 13. Plan for the total synthesis of a yeast alanine tRNA gene. The chemically 
synthesized segments are in brackets, the serial number of each segment being shown 
within the brackets. A total of seventeen segments (including 10' and 12') varying in 
chain length from penta to icosanucleotides were synthesized. 
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i.e., had an overlap of four to five nucleotides, (b) The phosphorylation of 

ilted the 5'-hydroxyl group with [ 32 P]ATP, using T4 polynucleotide kinase, 

and ( c ) The head-to-tail joining of the properly aligned segments to form 

if al. bihelical complexes, using the T4 polynucleotide ligase. 

r the After five years of intensive work, the total synthesis of yeast 

.5). alanine tRNA gene (4) was achieved. Although the synthesis was a 

initi- great accomplishment and a tribute to the chemical methodology and 

i the enzymology developed through this process, attempts to do any 

;hus, biological work with the artificial gene were not successful, owing to 

rived its l ac k °f sufficient information for the desired in vitro transcription to 

y are make alanine tRNA, the gene product. Instead, the E. coli DNA- 

,tein- dependent RNA polymerase made only limited amounts of the tran- 

rases, script, which initiated and terminated nonspecifically. 

,NAs, The second gene synthesized by Khorana et al. was the E. coli 

gene tyrosine tRNA gene. The major consideration favoring synthesis of this 

fane- § ene was the relative abundance of information available for the E. 

-long c °h system. This included an understanding of the cell-free protein- 

mical synthesizing system, the biochemistry of the ribosomes, 1 and the van- 

to 20 ous factors required for initiation, elongation, and termination of 

rands polypeptide chains. 2 In addition, the aminoacyl-tRNA synthetase 

rands corresponding to the tyrosine tRNA (134) had been purified and 

characterized, and the primary sequence of this tRNA determined by 
two independent methods (135-138). Extensive genetic and biochem- 
ical analysis of this gene has been carried out (139-141). Using the 
" Al lysogenic bacteriophage $80, which contains the E. coli tyrosine tRNA 

suppressor gene, and using suppression as the assay, a large number of 
mutants involving the tyrosine tRNA gene were isolated. The nature 
of these mutations and changes in function were studied. The tyrosine 
« i"! ! tRNA exists in vivo as a 126-nucleotide-long precursor containing a 5' 

| triphosphate. This precursor is then processed to give the mature 

j tRNA (142). z 

} \ The plan for the chemical synthesis of tyrosine tRNA gene in- 

*. [ci eluded the sequences shown in Fig. 14. A total of 26 fragments, 4-12 

nucleotides long, were synthesized. These fragments were joined to- 
gether by T4 DNA ligase in a stepwise fashion to get the 126 base- 
o . pair-long duplex DNA coding for the precursor tRNA. Single-stranded 

w . [n ! tails at the ends of the synthetic DNA were included to allow for 

m 

mically 1 See articles by Brimacombe et al. in Vol. 18, and by Bermek and by Spirin in this 

r s hown volume (Ed.) 

rying in 1 See article by Grunberg-Manago and Gros in Vol. 20 of this series (Ed.) 

3 See article by J. D. Smith in Vol. 16 of this series. 
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FlG. 14. Plan showing the 26 segments synthesized chemically in the total synthesis 
of the bihelical DNA corresponding to the precursor for the Escherichia coli tyrosine 
suppressor tRNA gene. 



-/a AGGGGGTGGTGGTAGTGAAAGTTTTCAGGCTTTCTTAA 
r G. vttccccc ACCACCATCACTTTCAAAAGTCCGAAAG r 



t C' Gc TTGGAAGCTTCAGCTACTGCCGTCTAAATCTC A g"^ 
^ A ACCTTCG AAGTCG AT6ACGGC AG AT T T A G AGTC T ^,,C_ 



c GTAATGGGCACCACCCCAAGGGCTCGCCGGTTTC, 
T^'aGCATTACCCGTGGTGGGGTTCCCGAG CGGCCAA a g > 



;-CCGGACGAGGGAATAGCCCTTCGCCCCGCGTAGTA t 
Ig^CCTGCTCCCTTATCGGGAAGCGGGGCGCATCATa ,a 




GAAAGAGTTGCATTGTGAAATGTCGCCGCGCAGT 'jj 



}■ A ATTCTTTCTCAACG taacactttaca gcggcgcgtc 

I ""- » » » 



Fig. 15. The sequence of the Escherichia coli tyrosine suppressor tRNA gene con- 
taining the promoter and the terminator region, synthesized chemically. 
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extension to the regions comprising the promoter and' the terminator 
*'! region of the genes. 

j ) To make the synthetic gene a functional unit, the sequence of the 

promoter region (56 base-pairs) and the processing terminator region 
s ., (25 base-pairs) was determined (143) by the primer-extension method, 

i using the 080 pSufo DNA as a template. Finally, the fragments corre- 

sponding to these two control regions were synthesized and joined to 
the structural gene. Thus, the total synthesis of the gene, complete 
with its promoter and terminator regions, was achieved (Fig. 15) (40). 
s > To test for the biological activity of this gene, EcoRl endonuclease 

recognition sequences were added to both ends of the gene for cloning 
r , ! into E. coli. Biological activity was assayed by the ability of the cloned 

gene to suppress amber mutations in vivo. Integration of the synthetic 
gene into the A. phage genome gave it the ability to form plaques on 
Su~ host bacteria. The presence of the E. coli tyrosine suppressor 
5 ' tRNA gene in the recombinant phage was confirmed by its reisolation 

151 from phage DNA after amplification. The synthetic gene was also in- 

corporated into the colicin Ej ampicillin-resistance plasmid. The re- 



(»') combinant plasmid transformed bacteria for both Amp r and 

(si suppressor-plus (su + ) markers with approximately equal efficiency 

(144). 

^thesis In addition to the synthesis of tRNA genes, the DNA coding for 

y f0sine biologically active peptides have been synthesized in various labora- 



tories, such as the synthesis of oligodeoxynucleotides corresponding 
to the bovine insulin chain A (145), to the S-peptide of bovine pan- 
creatic ribonuclease A (6), and to the human peptide hormone an- 
giotensin II (7). Attempts to translate these synthetic DNA molecules 
into peptides are in progress. If successful, they may help in produc- 
ing useful peptide products via DNA synthesis. 

B. Synthetic Oligodeoxyribonucleotides as Probes for Genes 

Synthetic oligonucleotides can also be used as specific probes for 
studying various aspects of gene structure and function. Synthetic 
oligonucleotides corresponding to portions of several genes have been 
synthesized, e.g., k endolysin (39), T4 lysozyme (35-38), and yeast 
iso-l-cytochrome c gene (87, 117). These oligonucleotides can be used 
as primers for determining the primary DNA sequence by the primer 
extension method (see Section IV, A). They can also be used for isolat- 
ing specific mRNA and for locating the respective genes by the hy- 
bridization method of Southern (146). 

Detailed study of the yeast iso-l-cytochrome c gene, using a syn- 
thetic oligonucleotide as a probe, is in progress in the author's labora- 
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tory. A pentadecadeoxynucleotide (15-mer) primer, 5'd(A-G-C-A-C- 
C-T-T-T-C-T-T-A-G-C), corresponding to amino acids 8 to 12 of yeast 
iso-l-cytochrome c, has been synthesized. Using the mRNA as a 
template and extending the primer by DNA-polymerase-catalyzed 
repair synthesis, a sequence of 20 nucleotides at the 5' end of 
the AUG initiation codon was obtained (Szostak, Stiles and Wu, 
unpublished). 

The 15 -mer has been chemically linked to cellulose an im- 
mobilized support, with the condensing agent dicyclohexylcar- 
bodiimide, and used to purify and isolate the iso-l-cytochrome c 
mRNA (Szostak et al, unpublished). A hexanucleotide, d(A-C-C-A- 
G-C), which corresponds to the two major codons of Bombyx mori 
fibroin mRNA, GCU (alanine) and GGU (glycine), has been synthe- 
sized (147). An affinity column made of polymerized hexanucleotide 
will preferentially trap the mRNA. The availability of pure mRNA 
should facilitate work in this system. 

A labeled 15-mer corresponding to the cytochrome c gene has also 
been utilized for identifying the restriction fragments containing the 
iso-l-cytochrome c gene. Yeast DNA was digested with either Eco RI 
or Baml restriction enzyme and fractionated on an agarose gel. After 
transferring the DNA from the gel to a nitrocellulose filter (146), 32 P- 
labeled 15-mer or an enzymically extended 15-mer (cDNA) was used 
as a probe to hybridize with the cytochrome c gene. A band presum- 
ably containing the iso-l-cytochrome c gene was detected on the gel. 
With this synthetic probe, it was found that in yeast with a wild-type 
iso-l-cytochrome gene the EcoRl DNA fragment carrying the gene has 
an M r of 4.5 x 10 6 . A mutant with a mutated EcoRI site in the cyto- 
chrome c gene the size of the EcoRI fragment is about 6 x 10 6 (Szostak, 
Stiles, Sherman and Wu, unpublished). 

The labeled 15-mer has also proved useful in screening plasmids 
for the cytochrome c gene from a colony bank of yeast DNA. Once the 
yeast iso-l-cytochrome c gene is cloned, large amounts of the cyto- 
chrome c DNA can be isolated. For sequence analysis, the synthetic 
15-mer functions as a primer to be annealed with the cloned cyto- 
chrome c DNA (strand separation may or may not be necessary) and 
extended with the use of E. coli DNA polymerase I. Then either the 
method of Sanger and Coulson (113) or that of Maxam and Gilbert 
(101) can be used for sequence analysis. 

Using the same principle as in the probing and analysis of the yeast 
cytochrome c gene (117), synthetic oligonucleotides with defined se- 
quences can be very valuable as hybridization probes for the isolation 
and cloning of single-copy genes in mammalian cells. For single-copy 
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genes that produce appreciable amounts of mRNA (e.g.' more than 
10% of the total cellular mRNA), the specific mRNA can be purified, 
copied into cDNA and double- stranded DNA, and then cloned (148, 
149). However, for genes that produce small amounts of mRNA (less 
than 1% or 0.1% mRNA of the total mRNA) the above method is not 
usable. There are two approaches to the cloning of these single-copy 
genes. An oligonucleotide approximately 18-20 nucleotides in length 
can be synthesized and used as a primer and hybridized to the in- 
tended specific mRNA in a mixture of mRNAs from a mammalian cell. 
Following enzymic extension of the synthetic primer, the resulting 
cDNA can be made double- stranded and cloned (148, 149). Alter- 
natively, a given DNA may first be digested by a restriction enzyme 
and fractionated on an agarose gel. The synthetic oligonucleotide may 
then be used as a probe for hybridization with a specific gene in the 
original DNA. Next, the DNA can be cloned, and the transformed cells 
may be screened for the desired gene by hybridization with the same 
synthetic probe (150). 



VI. Synthetic Oiigodeoxyribonucleotides for Studying 
Protein-DNA interactions 

One of the most basic problems in molecular biology is under- 
standing protein-nucleic acid interactions; i.e., how do the specific 
amino acids of proteins interact with the nucleotide sequences of the 
DNA? The sequences of a number of interesting segments of DNA 
such as operator, promoter, and restriction endonuclease recognition 
sites have been elucidated in recent years. For studying the point-to- 
point interaction of nucleic acids with proteins, large amounts of the 
DNA fragments are required. DNA fragments may be isolated follow- 
ing suitable restriction enzyme digestion, incorporation into bacterial 
plasmids, and amplification. Alternatively, they can be chemically 
synthesized using one of the methods discussed in Section II. 

The majority of the DNA fragments involved in protein binding are 
relatively short and may be synthesized in large quantities. Chemical 
synthesis followed by enzymic synthesis or chemical synthesis alone 
provides several advantages over the isolation of DNA by restriction 
enzyme digestion of natural DNA: (a) one can synthesize a DNA of 
desired length without the necessity of including any extraneous 
DNA; (b) one can modify the synthesis so that altered or modified 
bases can be introduced at specific locations and their effect on protein 
recognition analyzed; (c) one can obtain larger quantities of a specific 
DNA fragment for physicochemical studies. In this section we discuss 
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a few examples in which synthetic oligonucleotides have been 
utilized for studying protein-nucleic acid interaction. 

A. The Lactose Operator— Repressor System 

During the past decade, the genetics and biochemistry of the lac- 
tose operator-repress or system have been subjected to extensive 
study. The primary sequence of both operator and repressor has been 
determined. Gilbert and Maxam (151) obtained the sequence of the 
lac operator by isolating and sequencing the fragment protected by the 
lac repressor from pancreatic DNase digestion. This 27-nucleotide- 
long duplex fragment has a region of 2-fold symmetry, 21 nucleotides 
long, which was considered to be a significant feature in the specific 
recognition by the repressor protein. 

We have recently extended the studies of this system by the use of 
synthetic oligonucleotides corresponding to the lac operator region. 
This section summarizes only the salient features, since a detailed 
account has been presented elsewhere (24). In the first step, two 21- 
nucleotide-long single- stranded DNAs were chemically synthesized 
by a modified phosphotriester method (9). The two strands were then 
annealed to produce a duplex 21-mer with a sequence identical to that 
of the lac operator region (Fig. 16). This duplex DNA was also synthe- 
sized by the joining of short oligonucleotide fragments with T4 DNA 
ligase, or by using primer extension-repair synthesis (see Section IV, 
A). The in vitro biological activity of the synthetic 21-mer duplex was 
examined by binding it to the lac repressor and assaying the complex 
by the Millipore filter assay. The 21-mer duplex did indeed bind to the 
lac repressor, and the binding was specific (9, 152), as shown by its 
drastic reduction by the inducer iPrSGal (Fig. 17). To test whether the 
synthetic 21-mer duplex could function as an operator in vivo, this 
sequence was introduced into bacterial plasmids by the use of suitable 
synthetic adaptors (see next section). When introduced intoE. coli, the 
hybrid plasmids, now containing the synthetic lac operator, activated 
the /3-galactosidase synthesis (15, 77, 153), which was assayed by stan- 
dard procedures. 

Once it was shown that the 21-nucleotides-long duplex DNA con- | 
tained all the information necessary for recognition by the lac re- 
pressor protein in vitro and in vivo, a number of shorter duplex se- 

■>' AATTGTGAGCGGATAACAATT 
3'TTAACACTCGCCTATTGTTAA 

Fig. 16. Sequence of the chemically synthesized 21-nueleotide lactose operator 
DNA. 
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Repressor (^1) 

Fig. 17. Binding curve of the synthetic 21-nucleotide lac operator DNA to the lac 
repressor. The 5' 32 P labeled lac operator duplex was taken up in binding buffer (9), and 
various volumes of the lac repressor (15 fig/ml) were added. Triplicate samples (30 /xl 
each) were filtered on 13-mm Millipore filters, and the filters were counted (O). Control 
samples contained 1 mM isopropylthiogalactoside (•) to inhibit this specific binding. 

In this experiment (9), only about 10% of the input counts in the duplex operator 
fraction were retained on the filter, for the following reason. The single-stranded 21-mer 
was approximately 90% pure. In annealing to form the 21-mer duplex, 81% duplex of 
the correct size should be formed. However, since each single-stranded 21-mer has 
extensive self-complementary sequences and can form hairpins, and some nucleotides 
may be depurinated, the yield of completely intact 21-mer duplex was approximately 
30%. In a second report (152), the binding was only 4%, owing to the fact that one of the 
single-stranded 21-mer segments was also contaminated with a 15-mer (the precursor of 
21-mer). On the other hand, in more recent experiments (10), the 21-mer duplex was 
constructed by enzymic repair synthesis of synthetic 12-mer or 15-mer annealed to a 
21 -mer. Therefore, there is less chance for the single strands to form self-complementary 
structures. Furthermore, the nucleotides incorporated by repair synthesis are not dam- 
aged. As a result of these improvements, 40% retention of the input 21-mer duplex on 
the filter was observed in the presence of the lac repressor (10). With a half-life of 
approximately 40 seconds, 40% retention is close to the theoretical maximum. 
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quences were synthesized to determine the minimal essential recogni- 
tion length of the lac operator. The shorter lac operator sequences 
were synthesized as shown in Fig. 18. In studying their binding 
efficiencies with the lac repressor, we found that a specific 17- 
nucleotide-long duplex lac operator possessed all the features neces- 
sary for recognition by the lac repressor (10). Finally, the 17-mer se- 
quence was introduced into a bacterial plasmid and shown to function 
in vivo as a lac operator by activating /3-galactosidase synthesis in E. 
coli (10). 

In addition to determining the minimal essential length of the lac 
operator, several sequences with modified bases have been synthe- 
sized^, 11, 12, 24). A detailed study of modified operators of this type 
should yield additional information regarding the roles of various nuc- 
leotides specifically recognized by the lac repressor. 

B. The E. coli Tyrosine tRNA Gene- Promoter System 

Several modified DNA sequences in the promoter region of the E. 
coli tyrosine tRNA gene have been synthesized (1 54) for studying the 
interaction of these DNA molecules with RNA polymerase. Their 
study has been directed toward answering the following questions. (1) 
Do nucleotide sequences at and following the initiation site of tran- 
scription influence RNA polymerase binding or transcription? To in- 
vestigate this, the promoter is joined to parts of the synthetic gene 
below the natural initiation site. (2) The tyrosine tRNA gene promoter 

s' A A T T f, T G A G C G 6 A T A A C A A T T 
3 ' CTCGCCTATTGTTAA 
|» ATP 

A 

|+ CTP 

C A 

r' ATP 



A A C A 

^ UP 

T T A A C A 

Fig. 18. The plan showing the synthesis of lac operator sequence of vaiying length 
by partial repair synthesis using the upper 21-mer as template and the 15-mer as primer. 
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shows an extremely weak binding to RNA polymerase when com- 
pared to bacteriophage promoters. Is this due to the high (G + C)- 
content of the tyrosine tRNA promoter region that precedes the initia- 
tion site? To test this, the structure of the promoter has been modified 
to lower the (G + C)-content and to remove the symmetrical sequence. 
Systematic studies of this kind promise to provide a wealth of informa- 
tion regarding the regulatory mechanism of transcription. 

C. The Restriction Endonuclease System 

Another area of interest in the protein-nucleic acid interaction con- 
cerns the mechanism of restriction enzyme recognition and the 
cleavage of DNA. Restriction endonucleases recognize DNA se- 
quences 4 to 6 nucleotides long and cleave the DNA specifically at 
those sites. A self-complementary octanucleotide, d(pT-G-A-A-T-T- 
C-A) that contains the recognition sequence of the EcoRl endonuc- 
lease has been synthesized^. This octamer can act as a substrate for 
both the restriction endonuclease and the methylase; the cleavage 
and methylation occur at the same location on the DNA. From these 
results, it was concluded that the substrate for the enzyme is a short 
double-helical segment of DNA containing the central hexamer of the 
octanucleotide, and that it is unlikely that any cruciform rearrange- 
ment is required for enzyme activity. 

The Baml connector decanucleotide, d(C-C-G-G-A-T-C-C-G-G), 
not only contains the Baml site and is cleaved by Baml endonuclease 
(15), but also contains the recognition sequence d(C-C-G-G) for 
Hpalll restriction endonuclease. However, the sequences are not 
cleaved by Hpall endonuclease, presumably because the sequences 
are located too close to the termini of the decanucleotide. On the other 
hand, when two Baml connectors are joined by blunt-end ligation 
using T4 DNA ligase, the resulting 20-mer becomes an active sub- 
strate for Hpall (Fig. 19). The 20-mer was found to be degraded by 
Hpall endonuclease at the locations marked by the arrows to produce 
5'- 32 P- labeled 7-mer and 11-mer (Bahl and Wu; Smith, unpublished) 
and by Haelll endonuclease to give the original decamer. Further- 
more, the 20-mer proved to be excellent substrate for M ■ Hpall and 
M - Haelll methylases in the presence of S-[ 3 Hladenosylmethionine 
(H. Smith et al, unpublished). The 20-mer can be methylated at po- 
sitions 8 and 12 using M • Hpa II methylase, or at positions 11 using 
M ■ Haelll methylase. 

Nucleotide recognition sequences for other restriction endonuc- 
leases, such as Hindlll (15, 16) and Pstl (155), have also been chemi- 
cally synthesized. The Baml and Hindlll endonucleases each recog- 
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Haelll 

I 1 

Hpal l Hpal l 

1 ' 1 ! 

1 3 5 8 10 12 
4- + 

5 ' 32 pC-C-G-G-A-T-C-C-G-G 32 pC-C-G-G-A-T-C-C-G-G 
3 ' G-G-C-C-T-A-G-G-C-Cp 3 2 G-G-C-C-T-A-G-G-C-C 32 p 



+ 
12 



10 8 



Fig. 19. Sequence of a 20-nucleotide-long duplex DNA formed by the dimerization 
of two 10-nucleotide-long Barrel connector sequence. This duplex DNA can be cleaved 
by restriction endonuclease Baml, Hpall, and Haelll. 

nize a hexanucleotide sequence in a large DNA molecule and cleave it 
at that site. In synthetic decamers containing the recognition sequence 
for these enzymes, only the Baml endonuclease produces cleavage. 
When either decamer is dimerized or polymerized, both endonuc- 
leases produce cleavage at the sites of recognition sequences. This 
indicates that the minimum size requirements for binding and cleav- 
age in these enzymes are different. It appears that a decamer is too 
small for the Hindlll endonuclease to bind, although it is large enough 
for the Baml endonuclease to bind and cleave (Bahl, Wu and Narang, 
unpublished). 



VII. Synthetic 01 igodeoxy ribonucleotides as Tools in 
Molecular Cloning of DNA 

During the last four years, useful techniques have been developed 
for the in vitro joining of DNA segments to vehicle DNA molecules 
capable of independent replication (156-159). The cloning vehicle 
maybe a plasmid DNA (156, 157), a phage A DNA (1 58, 1 59 ), or an SV40 
DNA(160). After joining the DNA segment to the cloning vehicle, the 
resulting hybrid DNA (known as chimeras) can be used to transform a 
suitable cell. The hybrid DNA can then be selected from among the 
transformed cells, and its expression in terms of DNA replication, tran- 
scription, or translation studied. 

The three essential steps in cloning a DNA molecule are shown in 
Fig. 20. First, the DNA molecule (which contains the fragment to be 
cloned) is cut to a desired length. The best method is cleaving the 
DNA molecule with a suitable restriction enzyme (161) in such a way 
that only one fragment contains the sequence desired for cloning (des- 
ignated as DNA-X in Fig. 20). Next, the DNA fragment is joined to the 
cloning vehicle DNA by means of DNA ligase. To achieve this, match- 
ing cohesive ends must be present at the termini of both DNA-X and 
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FIG. 20. A scheme for producing and joining of DNA(X) to a cloning vehicle and 

I I A 7 by " hGaVy lme - In 5te P ^> both the DNA containing the segment to 
be cloned and the circular vehicle DNA are cut using restriction enzyme I. InTep^b) 

DNA X vehicTe ZaI T ^ 7^ ° NA ^ ° NA ^ In the » 
i X -; ehlcIe DN A hybrid is used to transform a permeable E. coli cell. Note that the 
circular forms represent double-stranded DNA. 

the cloning vehicle. Finally, the hybrid DNA is introduced into func- 
tional living cell, and a suitable method is used for selecting the clone 
containing the specific hybrid DNA. 

We now describe newer methods for improving the second step 
the joining of DNA to the cloning vehicle, by using chemically synthe- 
sized ohgodeoxynucleotides. 

A. A Specific Method Involving Synthetic Cohesive Ends 

A DNA segment for cloning can be chemically synthesized to in- 
clude a protruding single- stranded sequence that corresponds to the 
recognition sequence of a restriction endonuclease. For example lac 
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operator DNA has been synthesized (153) to include a protruding 5' 
d(pA-A-T-T) sequence corresponding to part of the recognition se- 
quence of EcoRI restriction endonuclease (162). A synthetic lac 
operator (21-nucleotide-long duplex) containing a protruding 5' 
d(pA-A-T-T) sequence at each end is shown in Fig. 21. A molecule of 
circular pMB9 plasmid DNA has been cut once at the desired point by 
EcoRI restriction endonuclease to produce a linear pMB9 DNA with a 
protruding 5' d(pA-A-T-T) sequence at each end. Next, H-bonds are 
formed between the protruding sequences of the two types of mole- 
cules (I and II). Finally, they are joined covalently using T4 DNA 
ligase to produce a circular hybrid Jac-pMB9 DNA. This hybrid is 
capable of transforming competent E. coli cells and expressing its 
biological activity in vivo (153). 

Two other investigators have made independent use of the same 
principle. The synthetic E. coli tyrosine tRNA gene and its regulatory 
elements together with protruding 5' d(pA-A-T-T) sequences has been 
joined to plasmid or phage DNA for molecular cloning (40, 154). An 
artificial minigene coding for part of RNase, which consists of two 
decadeoxynucleotides containing the same protruding sequence, is 
being joined to a vehicle DNA for cloning (163). 

In all three examples, the protruding restriction enzyme recogni- 



5' pA-A-T-T-C — G 3' 

3 ' G _ C-T-T-A-Ap 5 ' 

+ 



Circular EcoR I 5' pA-A-T-T . 



II 



P MB9 DNA ^3' __ T-T-A-Ap 



G-A-A-T-T 

I 
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FIG. 21. Joining the chemically synthesized lactose operator duplex containing pro- 
truding 5'd{pA-A-T-T) sequence to EcoRI endonuclease-cut pMB9 DNA. Structure I 
represents the synthetic lac operator to be cloned, a 21-nucleotide-long duplex contain- 
ing protruding 5' d(pA-A-T-T) sequences at each end. Structure II represents a linear 
pMB9 plasmid DNA to be used as the cloning vehicle. It is formed by cutting a circular 
pMB9 DNA using EcoRI endonuclease, and also contains a protruding 5' d(pA-A-T-T) 
sequence at each end. Structure III represents a circular hybrid iac-pMB9 DNA formed 
by joining structures I and II at their 5' d(pA-A-T-T) ends with T 4 DNA ligase. The 
arrows mark the positions where phosphodiester bonds are formed. 
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tion sequence is a part of the synthetic gene or genetic element and 
can thus only be used for the cloning of the designated DNA. 

B. A General Method Involving Connectors 

A more general method has been developed (15, 77) in which 
chemically synthesized connectors (or linkers) are used to create 
cohesive ends at the termini of blunt-end DNA molecules. Thus, any 
double-stranded DNA molecule can be cloned. The principle behind 
the method is shown in Fig. 22. In the first step, a blunt-end DNA 
molecule such as molecule A is joined end-to-end to a synthetic 
linker molecule (such as Baml linker B) using the blunt-end ligation 
activity of the DNA ligase (73, 74). The resulting molecule, with a 
linker molecule added to each end of the original, is shown in C. 
Next, this molecule is digested by the suitable restriction enzyme (in 



CD) 



ii 



CD') 



5' pA-A-T-T-G-T-G~A-G-C-G-G-A-T-A-A-C-A-A-T-T + 5' pC-C-G-G-A-T-C-C-G-G 
3' T-T-A-A-C-A-C-I-C-G-C-C-T-A-T-T-G-T-T-A-Ap 2 3' G-G-C-C-T-A-G-G-C-Cp 

T4 ligase (B) decanucleotide duplex 

(Bam I linker molecule) 



(A) Lac operator duplex 



(C) 5' pC-C-G-G-A-T-C-C-G-G 
3' G-G-C-OT-A-G-G-C-C 



C-C-G-G-A-T-C-C-G-G 3' 
G-G-C-C-T-A-G-G-C-Cp 5" 



Bam I endonuclease 



pHB9 
SNA 



pG-A-l-C-C-G-G 
G-C-C 

Bam I 
endonue lease ^ 



C-C-G 3' 
G-G-C-C-T-A-Gp 5 ' 



linear 

pMB9 DNA 



T 4 polynucleotide ligase 
Hybrid Lac— pMB9 DNA 



5' 
3' 



pA-A-T-T-G-T-G-A-G-C-G-G-A-T-A-A-C-A-A-T-T + 5' G-A-T-C-C-C-C-G-G-G 
T-T-A-A-C-A-C-T-C-G-C-C-T-A-T-T-G-T-T-A-Ap G-G-G-C-C-Cp 



3* 



CA) 



pG-A-T-C -C-C-C-G-G-G 
G-G-G-C-C-C 



<1) T 4 Ligase 
M/ (ii) Kinase 



(B') 



C-C-C-G-G-G 
G-G-G-C-C-C-C-T-A-Gp 



Fig. 22. A general scheme for introducing protruding restriction enzyme sequences 
to the ends of any duplex DNA molecule for cloning. Dashed lines indicate the recogni- 
tion site of a restriction endonuclease (Baml in this example). Long double lines repre- 
sent the 21-nucleotide-long duplex of lac operator DNA. 
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Fig. 22, I, Baml restriction endonuclease) to create the Baml endo- 
nuclease cohesive ends, 5' d(pG-A-T-C). 

Alternatively, a ready-made Baml adaptor (Fig. 22, II) can be used. 
As no digestion with Baml endonuclease is required to create the 
cohesive end, this adaptor can be used for cloning DNA molecules that 
have internal Baml sites. In the final step, this molecule is joined to a 
DNA vehicle (cut by the same endonuclease) to produce a hybrid 
DNA, as described earlier (see Fig. 20). 

Chemically synthesized connector molecules are useful tools in 
cloning since the same type of oligonucleotide connector can serve to 
introduce a variety of double-stranded DNA molecules into cloning 
vehicles at specific sites. Thus far, connector molecules carrying the 
EcoM (14, 16), Baml (15, 16), Hindlll(l5, 16) and Pstl (155) sites have 
been synthesized. 

C. A General Method Involving Adaptors 

1. Description of Method 

A second general method has been developed in our laboratory 
(155). This method uses synthetic adaptors to convert one type of 
cohesive end to another for the cloning of DNA molecules. The prin- 
ciple is illustrated in Fig. 23. In many instances, a DNA molecule to 
be cloned (DNA-X) is most conveniently obtained by cutting a long 
DNA molecule with a specific restriction enzyme, and thus carrying a 
particular cohesive end (e.g., Eco RI sequence). However, the best 
cloning vehicle may be a DNA carrying a different cohesive end, e.g., 
a Baml sequence. Cloning at this site of the pBR313 plasmid has the 
advantage of inactivating the tetracycline-resistant gene (164), thereby 
providing a handle for easy selection of the cloned DNA-X. 

To convert one type of cohesive end to another, we synthesized 



3 t ci 



5' 



5' + 3' 



3' 5' 



DMA (X) 



5' + 3' 



RI 
site 



14 DNA llgase 



DMA(X) 



3' 



3' 
5' 



Bam l 
site 



Bam l 
site 



Fig. 23. A method for converting one type of cohesive end sequence to another by 
the use of double-headed adaptors. 
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Bam I 
site 



5' V 

pC~C-G-G-A-T-C-C-G-G 

G-G-OC-T-A-G-G-C-C 
BamI adaptor 



i. 

pG-C-G-A-A-T-T-C-G-C 
C-G-C-T-T-A-A-G-C-G 



EcoR I adaptor 



T4 ligase 



5' 



C-C-G-G-A-T-C-OG-G-G-OG-A-A-T-T-C-G-C 
G-G-C-C-T-A-G-G-C-C-C-G-C-T-T-A-A-G-C-G 



3' 



5' 



Bam I ^ 



+ EcoRI 
3' 



pG-A-T-C-C-G-G-G-C-C 

G-C-C-C-G-C-T-T-A-Ap 



FlG - 24. A method for synthesizing a conversion adaptor by blunt-end ligation of two 
different types of connectors, followed by cleavage by two restriction endonucleases. 
Alternatively, the ready-made adaptor (lower structure) can be synthesized chemically. 

several double-headed adaptor molecules possessing different cohe- 
sive ends at their termini. In Fig. 23, adaptor molecules with a BamI 
site on one end and an Eco RI site on the other end are joined with 
DNA ligase to DNA-X by matching their EcoRI sites. The resulting 
DNA-X molecule now contains two BamI sites and is ready to be 
joined to a cloning vehicle carrying BamI sites (15), 

We have synthesized several double-headed adaptors using two 
methods. In the first, two different even-ended connectors (such as the 
10-nucleotide-Iong BamI and EcoRI connectors) are joined end-to- 
end by blunt-end ligation as shown in Fig. 24. After purification, the 
desired 20-nucleotide-long double connector is cleaved by both BamI 
and EcoRI endonucleases to create the double-headed adaptor. The 
disadvantage is that only small amounts of the adaptor can be made 
owing to limitation at the DNA ligase step for blunt-end ligation. 

In the second method, the double-headed adaptor (such as that 
shown on the bottom part of Fig. 24) is directly synthesized by chemi- 
cal methods. The double-headed adaptors synthesized by either 
method can be used for the conversion of Eco I sites to BamI sites or 
vice versa. Syntheses of other double-headed adaptors, such as 
Baml-Hinlll and Hinlll-EcoRI, have also been achieved (155). 

2. Conversion of 3'-Protruding Cohesive Ends to 
5'-Protruding Cohesive Ends by the Use of 
Single-Stranded Adaptors 

Several restriction enzymes cut DNA molecules to produce 3'- 
protruding cohesive ends, such as Haell (108) and Pstl (158). Two 
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methods are available for joining a DNA molecule containing 5'- 
protruding ends to one containing 3'-protruding ends. In the first 
method, the 3' protruding ends can be removed by digestion with 
Aspergillus S x nuclease (165), which removes the single-strand ends 
completely (166), The proper connectors can then be added to the 
even-ended duplex DNA using the process described in Section VII, 
B. The disadvantage of using Sj nuclease is that if the DNA-X 
molecule to be cloned contains single-strand breaks, the nuclease may 
produce unwanted double-strand breaks at these points. In the second 
method, a single-stranded decanucleotide such as 5' d(pA-A-T-T-C- 
A-G-C-G-C) can be ligated to the termini, thus converting 3' protrud- 
ing Haell sites on a DNA molecule to 5' protruding Eco RI sites (155). 

D. Synthesis of Connectors and Adaptors of Different Lengths 

After a DNA segment has been cloned, the next step is the tran- 
scription and translation of that segment. In cases where RNA synthe- 
sis is initiated from a promoter site on the vehicle DNA, correct 
translation requires that the reading frame be maintained, i.e., that the 
length of DNA-X plus the connector (or adaptor) be a multiple of 3. 
Since in most cases the exact length of DNA-X is unknown, connectors 
of 3 different lengths that will add n, n + 1 or n - 1 nucleotides (where 
n is a multiple of 3) must be synthesized. For example, ligation of a 
linker molecule B' (Fig. 22) to a DNA will add 6 nucleotides (n + 0). 
For adding n + 1 nucleotides, the hexanucleotide double-stranded 
portion of linker B' can be changed to a heptanucleotide. Connectors 
of this type are being synthesized in our laboratory. 

VIII. Concluding Remarks 

The techniques developed for the synthesis of oligonucleotides of 
defined sequences can be used as primers for DNA sequence analysis. 
They can also be applied to produce modified genes or genetic ele- 
ments. These, in turn, may be used to determine which parts of a 
molecule under consideration are essential to its functioning and 
which are not. For example, the lactose operator may be synthesized 
with altered bases or base-pairs at prescribed locations. The effects of 
these alterations, whether they be an increase, decrease, or no change 
in the binding of lactose repressor, may then be studied. Similarly, 
alterations of one or more base-pairs of a promoter site may be made, 
and the modified promoter tested for interaction with RNA 
polymerase. Once the essential features for promoter-RNA 
polymerase interaction are thoroughly understood, it may be possible 
to design and synthesize a sequence with enhanced promoter activity. 
Such a "super" promoter could then be joined to a gene coding for a 
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desired polypeptide, and that in turn could be joined to a cloning 
vehicle to produce a very efficient system for transcription of this par- 
ticular gene in vivo. Optimal conditions may be found so that large 
quantities of this RNA transcript would then be used to direct the 
synthesis in the microorganism of this protein product in bulk quan- 
tities. The potential for large-scale production of scarce biologically or 
medically useful products could have far-reaching benefits. 
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Summary 

A simple and sensitive radioimmunoassay using 
E. coli /3-galactosidase as a model protein has 
been developed for the detection of specific 
translation products of foreign gene fragments 
cloned into plasmid or phage vectors. This immu- 
noassay is based upon the coupling to an insolu- 
ble matrix of F{ab)' 2 fragments derived from the 
specific antiserum by pepsin digestion. The in 
situ analysis of phage plaques or of bacterial 
colonies is performed by overlaying the phage 
plaques or lysed bacterial colonies with a cellu- 
lose filter to which F(ab) 2 fragments have been 
chemically coupled. The antigen bound to the 
filter is detected by subsequent incubations with 
undigested antiserum and with '"l-fabeled Staph- 
ylococcus aureus protein A followed by autora- 
diography. By coupling the F{ab)' 2 fragments to 
the wells of a plastic microtiter plate, liquid cul- 
tures can be analyzed quantitatively for the pres- 
ence of antigen, making possible the analysis of 
heterogeneous cultures by sib selection. The de- 
tection threshold of the microtiter plate assay for 
liquid culture is shown to be <2 x 10 s molecules, 
or about 1 molecule of /J-galactosidase per cell. 
The in situ immunoassay for bacterial colonies, 
which permits examination of about 1000 clones 
per plate, can easily detect microcolonies pro- 
ducing about 10 molecules of /S-galactosidase 
per cell, while the in situ phage plaque assay, 
also capable of screening about 1000 plaques per 
plate, is even more sensitive, detecting <1 x 10 7 
molecules per bacteriophage plaque. 

Introduction 

The cloning of specific eucaryotic DNA sequences 
in E. coti requires a method for identifying those 
clones that have acquired plasmids or bacterio- 
phages containing the desired DNA fragments. In 
some instances, the biological activity of a gene 
product encoded by the cloned DNA fragment can 
be used for the direct phenotypic selection of 
clones in which the specific DNA sequence is 
expressed as a functional protein (Struhl, Cameron 
and Davis, 1976; Ratzkin and Carbon, 1977; Vap- 
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nek etal., 1977). Such detection systems, however, 
require a homologous bacterial function that can 
be inactivated in the host strain and, consequently, 
are usually limited to those eucaryotic proteins 
which have a bacterial counterpart. 

Identification of bacterial clones that contain 
specific eucaryotic genes inserted into plasmid or 
bacteriophage vectors have also utilized a messen- 
ger RNA (mRNA) or complementary DNA (cDNA) 
probe. Such probes have been used for in situ 
nucleic acid hybridization (Grunstein and Hogness, 
1975; Jones and Murray, 1975) and for filter hybrid- 
ization in a sib selection system (Kedes et al., 
1975). Detection of this kind, however, is depend- 
ent upon the availability of highly purified nucleic 
acid probes. 

Structural analysis of restriction endonuclease- 
generated DNA fragments (Morrow et al., 1974) 
and direct DNA sequence analysis (Ullrich et al., 
1977) have been used to detect and identify eucar- 
yotic DNA segments contained in plasmid mole- 
cules isolated from transformed E. coli cells. Anal- 
ysis of bacterial cell extracts for the appearance of 
new protein bands in polyacrylamide gels has also 
been employed (Chang et al., 1975; Meagher et al., 
1977), but such methods appear to be more suited 
to the analysis of specific clones than to the mass 
screening of bacterial populations. 

Perhaps the most general assay for screening 
bacterial clones for gene products encoded by 
cloned foreign DNA is one based on antibody- 
mediated recognition of antigenic determinants. 
(This approach requires that the determinants to 
be recognized be protein and not carbohydrate in 
nature.) Such an immunoassay would potentially 
be capable of detecting incompletely translated 
products as well as proteins which have no easily 
detectable or selectable function. Furthermore, 
proteins that are unable to function in a bacterial 
cell environment might nevertheless retain their 
immunologically reactive sites. Recently, in situ 
immunoassays have been reported, based on en- 
zyme-coupled antibody detection of protein in A 
plaques (Sanzey et al., 1976) and on immunopre- 
cipitation in top-layer agar for X plaques and bac- 
terial colonies (Skalka and Shapiro, 1976). Such 
assays, however, have been limited in their sensi- 
tivity. In experiments in which the expression of 
cloned eucaryotic DNA in procaryotic cells has 
been demonstrated (Struhl et al., 1976; Ratzkin 
and Carbon, 1977; Vapnek et al., 1977), the evi- 
dence suggests a low level of phenotypic expres- 
sion. Consequently, sensitivity is a primary require- 
ment for an experimentally useful system for de- 
tecting specific clones. 

We report a novel indirect radioimmunoassay 
using E. coli 0-galactosidase as a model gene 
}ned Page #2047 
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product. This assay is dependent upon the cou- 
pling of F(ab)' 2 fragments (bivalent immunoglobu- 
lin G fragments lacking the Fc portion) to an 
insolubte matrix. Such F(ab)' 2 fragments provide a 
means of specifically binding antigen to the solid 
phase, yet are unrecognized by the detecting re- 
agent, 125 l-labeled Staph, aureus protein A, due to 
the absence of the Fc segment. The indirect ra- 
dioimmunoassay therefore does not require purifi- 
cation and labeling of specific antibody. It is highly 
sensitive and quantitative, and can be used for 
screening individual clones or for sib selection 
analysis of heterogeneous cultures. In addition, it 
has been adapted for the in situ analysis of bacte- 
rial colonies and phage plaques, and can thus be 
used for both piasmid and phage vector cloning 
systems. 

Results 

The initial reaction in the multi-step radioimmu- 
noassay is the coupling of F(ab)' 2 fragments de- 
rived from pepsin digestion of rabbit anti-E. co!i 
0-galactosidase to an insoluble matrix. After the 
initial coupling step, the solid phase F(ab)' 2 frag- 
ments are incubated with either soluble j3-galacto- 
sidase or eel! lysates. Next, the undigested antise- 
rum is allowed to interact with the antigen which is 
bound, via the F(ab)' 2 fragment, to the solid phase. 
In the final step, 125 l-labeted protein A (from Staph, 
aureus), which specifically recognizes the immu- 
noglobulin Fc portion (Forsgren and Sjoquist, 
1966), binds to the bound immunoglobulin. After 
each step, unbound reagents are removed by 
washing. The binding sequence is represented 
diagrammatically in Figure 1 , and the dependence 
of final 126 l binding upon each previous binding re- 
action is presented in Table 1 . The wells of a poly- 
vinyl chloride microtiter plate were used as the in- 
soluble matrix for sib selection assays of bacteria 
growing in liquid culture; for the in situ colony or 
phage plaque assay, we used a chemically acti- 
vated cellulose filter. In the microtiter plate assay, 
quantitation is obtained by cutting out the flexible 
wells and measuring the bound 12S ! counts. For 
mass screening, the microtiter plates are analyzed 
by autoradiography. 

Plate Assay 

The sensitivity of the indirect immunoassay can be 



F(ab)' E ANTIGEN ANTIBODY PROTEIN A 




Figure 1 . Schematic Diagram of the Indirect Radioimmunoassay 



estimated with graded amounts of soluble j8-galac- 
tosidase (Figure 2A). The detection threshold is 
below 0.2 ng for the pure protein. The same assay 
monitored by autoradiography is shown in Figure 
2B. In this autoradiogram, the lower limit of detec- 
tion is about 0.5 ng. 

For linear quantitation of antigen, ail reagents 
[F(ab)' 2 , antiserum and protein A ,25 l] must be 
present at saturating levels. The background level 
of nonspecific binding, however, rises slightly as 
the concentration of the detecting reagents, partic- 
ularly antiserum, is increased. The concentration 
of these reagents should therefore be varied ac- 
cording to the estimated amounts of antigen to be 
detected. Thus the signal to noise ratio in the data 
presented in Table 1 can be significantly improved 
by using higher concentrations of antiserum, al- 
though the background radioactivity will rise 
slightly. The antiserum concentration in Table 1 
was optimized for detecting minute amounts of /3- 
galactosidase (Figure 2). Under the conditions of 
the assay shown in Figure 2, antigen is limiting 
only below 1 ng. 

For the detection of jS-galactosidase in bacterial 
cells, cultures of MC 1000 (lac' 7 ) and D7001 (/ac + ) 
were grown to saturation in L broth. For mass 
screening of individual clones, microcultures (300 
jxt) can be grown from single colonies or from 
small inocula for sib selection in the. wells of 
microtiter plates. Bacterial cells were lysed by 
adding 2 drops of chloroform per ml culture and 
SDS to a final concentration of 0.1%. The lysates 
(50 (A) were added to the F(ab)' r coated wells and 
reacted in the multi-step assay. The results from a 
typical experiment are shown in Figure 3. 

The initial rise in bound radioactivity as the lysate 
is diluted (that is, a "prozone" effect) suggests that 



Table 1. Binding of Protein A ,M I as a Function of Previous 
Binding Reactions in a Microtiter Plate 






Bound 12S I cpm 
(Protein A) 


F(ab)' 2 Antigen 


Antiserum 


SE 


+ ■ + 


+ 


12,059 ± 139 


+ + 




268 ± 13 


+ - 


+ 


1,109 ± 40 


- + 


+ 


274 i 8 



The wells of a polyvinyl chloride microtiter plate were coated with 
F(ab)' 2 fragments (75 ixi of 0.025 mg/ml) or 1% BSA (50 fil). 
Samples (50 ftl) of /3-galactosidase solution (1 /ig/rnl) or of 1% 
BSA (50 (lI) were incubated in the coated wells, followed by an 
incubation with 50 fcl of anti- /3-galactosidase serum (diluted 1/ 
2000) or of normal rabbit serum (diluted 1/30O0). Finally, 50 (J of 
,a l-labeled protein A were added to the wells, and the plate was 
treated as described in Experimental Procedures. All samples 
were tested in quadruplicate. 
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Figure 2. Binding Curve for 0-Galactosidase 

The wells of a polyvinyl chloride microtiter plate were coated with F(ab)'j fragments as described in Experimental Procedures, fi- 
Galactosidase samples (50 /if) were added to the wells, followed by incubations with antiserum and 1K l-labeled protein A (Staph, aureus) 
as detailed in Experimental Procedures. The amount of 0-galactOsidase per well was determined by making serial dilutions from a stock 
(1 mg/ml) solution of purified £-galaetosidase (Sigma). Samples were tested in quadruplicate; standard errors are represented by bars. (B) 
represents an autoradiogram of the same titration curve for 0-galactosidase antigen. The microtiter plate was placed over a Kodak XR-5 
film and intensifying screen for 24 hr. 



some inhibition of the antibody-antigen binding 
reaction occurs in the presence of detergent. (The 
degree of inhibition was variable, depending upon 
how much of the detergent was transferred from 
the Sysate into the assay weil.) Inhibition by deter- 
gent is not a complete explanation for the "pro- 
zone" effect, however, since a small rise in the 
binding curve is sometimes observed in sonicated 
extracts as well. Reactions were run with SDS 
concentrations from 0.5-0.025%, indicating an op- 
timal level of around 0.1%, since the efficiency of 
cell lysis was reduced at lower levels. The /3-galac- 
tosidase levefs in the lysates from induced lac + 
cultures are not limiting until the lysate has been 
substantially diluted. In Figure 3A, the undiluted 
lac + and the /ac v cultures can be distinguished; the 
binding of ,25 l-labeled protein A molecules to the 
lac v extract, however, is significantly above the 
background level. 



The low level binding to protein observed in a 
lac^ extract could be due either to contaminating 
antibody in the rabbit antiserum directed against 
other bacterial antigenic determinants or to a 
cross-reaction of the anti-j3-galactosidase with 
other immunologically related bacteria) proteins. 
To distinguish between these possibilities, a /ac v 
extract was prepared by sonication and coupled to 
CnBr-activated Sepharose. The antiserum was 
passed over the lac v extract affinity column, and 
the excluded volume fraction was tested (Figure 
3B). The binding to lac"? fysate was significantly 
reduced, suggesting the presence of some con- 
taminating antibodies in the original antiserum. All 
subsequent assays were performed with the ab- 
sorbed /3-galactosidase antiserum. 

The sensitivity of the immunoassay is such that 
even minor contaminants are detected; they would 
not be expected to present a problem, however, in 
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Figure 3. Comparison of lac* and lac v Lysates in the Indirect Radioimmunoassay 

Lysates (50 jd) from MC1000 [tec'(A)l and D7001 tjac*) cultured in the absence (O) or presence (O) of 10" 3 M IPTG were added to the 
Ffabj'rcoated wells of a microtiter plated After a 2 hr incubation, 50 pi of unabsorbed (A) or absorbed (B) antiserum (a 1/1000 dilution in 
1% BSA) were added to the well. 2 hr later, protein A- 12S I (50 pi) was incubated in the well (100,000 cpm per well) overnight at 4°C. All 
samples were tested in duplicate. 



identifying gene products encoded by foreign DNA. 
In homologous cloning systems, antiserum can be 
purified by absorption with extract from the appro- 
priate deletion strain. In heterologous cloning sys- 
tems, if antiserum directed against a eucaryotic 
protein has a cross-reaction with a bacterial pro- 
tein, the cross-reactive antibody can be absorbed 
similarly with bacterial extract coupled to Sepha- 
rose. 

The titration curve for radioactivity bound to the 
induced lac + lysate indicates that about 500 times 
as much immunologically reactive material is pres- 
ent as in the uninduced lac + lysate. These same 
lysates, assayed by the conventional colorimetric 
enzyme assay, show an approximately 300 fold 
induction ratio (Table 2). Thus the quantitation of 
antigen by the indirect radioimmunoassay for 0- 
galactosidase is in general agreement with meas- 
urements of enzyme activity. 

To estimate the number of cells producing gene 
product required for detection, lac + and /ac v cells 
were mixed in varying proportions and lysed, and 
the extracts were tested in the indirect radioimmu- 
noassay (Figure 4). Based on this reconstruction 
experiment, the limit of detection is on the order of 
1 /ac+ eel! in 10 3 /ac v cells, or about 5 x 10 4 j3- 
galactosidase-producing cells per well. 

^-Galactosidase levels in the EK2 bacterial host, 
x1776, have also been examined in this assay. In L 
broth cultures supplemented with thymidine and 
diaminopimmelic acid (DAP), the same SDS con- 
centration is required to achieve efficient cell lysis. 

In Situ Analysis (Phage Plaques) 

For the in situ analysis of /3-galactosidase in bac- 
teriophage plaques, we have adapted the immu- 



Table 2. 0-Galactosida$e Enzyme Activity in Saturated Cultures 
Enzyme Units per ml Culture 

(lac^) MC1000 0.98 
{lac*) D7001 (Uninduced) 14.05 
[lac*) D7001 (Induced) 3939.00 

Saturated cultures of MC100O (fac* 7 ), uninduced D7001 (lac*) and 
D700 (/ac*), induced by growth in the presence of 10~ 3 M IPTG, 
were lysed with SDS and chloroform, and assayed for /3~ga!aeto- 
sidase activity as described in Experimental Procedures. 




io-i I0" : 

Figure 4. Reconstruction Mixture of lac* and /ac v Cells 
Saturated L broth cultures of MC1000 (/aO and D7001 (/ac*) 
were mixed in various proportions, and lysed with chloroform and 
SDS as described in Experimental Procedures. The lysates were 
diluted Va in 1% BSA, and 50 fil were added to the F(ab)'2-coated 
wells of a microtiter plate. After a 2 hr incubation, 50 /i\ of 
absorbed antiserum (diluted 1/1500 in 1% BSA) were added to 
the well. 50 /J of protein A- ,2S I (about 1 00,000 cpm per well) were 
incubated in the well overnight at 4°C, and the plate was analyzed 
as described in Experimental Procedures. All samples were tested 
in quadruplicate; standard errors are represented by bars. 
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noassay such that the F(ab)' 2 fragments are cou- 
pled to a chemically reactive cellulose filter, acti- 
vated by the method of Alwine, Kemp and Stark 
(1977). Once the F(ab)' 2 fragments have been cou- 
pled to the circular cellulose filter and the reactive 
sites inactivated by treatment with 1 M glycine and 
0.5% BSA (bovine serum albumin), the filter is 
applied directly to the phage plate for 1 hr. The 
filter is then incubated with antiserum for 3 hr and 
subsequently with 125 l-labeled protein A for 3 hr, 
and finally washed and overlaid with x-ray film for 
autoradiography. The results for a mixture of A. 
CI857 p5/ac V (att)SR2 and X C1857 plated with 
MC1000 (lac?) cultures on YT plates (Miller, 1972) 
containing the /3-galactosidase stain X-Gal (5- 
bromo-4-chloro-3-indolyl-)3-D-galactoside) are 
shown in Figure 5. All plaques, identified on the 
phage plate by the X-Gal stain, are clearly detected 
as discrete dark spots on the x-ray film. To estimate 
the sensitivity of the in situ plaque assays, X phage 
bearing a lac gene, with a low efficiency promoter, 
X CI857 p5(/ac p _ 162), was plated alone and mixed 
with \ CI857 on MC1000 (Figure 6). Here, small 
discrete spots on the x-ray film identify \ CI857 p5 
(lac p~162) plaques, although the /3-galactosidase 
activity of these plaques was so low that they went 
undetected by the X-Gal stain in the top layer agar. 
The plaques, identified as Xp5{/ac p"162) by auto- 
radiography, were examined by picking and testing 
them individually in the colorimetric enzyme assay 
with ONPG as substrate. Since the activity in the 
plaques was so low, the putative Xp5(/ac p~162) 



A B 




Figure 5. fn Situ Radioimmunoassay for /3-Galactosidase in X 
CI857 p5 lac v(att)SR2 Plaques 

0.1 ml of a L broth dilution of A CI857 p5 lac V(att)SR2 mixed with 
0.1 ml of a dilution of A CJ857 was plated withG.1 m! of a saturated 
MC1Q00 IJac") culture in 2 ml H top agar containing 1 mg X-Gal 
on YT plates. The plates were incubated at 37°C for S hr and then 
overlaid with a F(ab)'z-coated cellulose filter for 1 hr. The filter 
was incubated with antiserum (1/1000) for 3 hr and then with a 
IM l-labeled protein A solution for3 hr. The washed and dried filter 
was overlaid with Kodak NS-T2 X-ray film for 6 hr. (A) YT plate 
with X-Gal stain in top layer agar. The dotted outline indicates the 
area overlaid by the cellulose filter. (B) Autoradiogram of the 
treated filter. 



plaques were also verified genetically by testing 
the ability of the phage to recombine with a strain 
carrying the small internal deletion, s(/ac z) m15 
(Langley et al., 1975) in the /3-galactosidase struc- 
tural gene. Recombinant (lac + ) colonies were 
scored on MacConkey's lactose plates. The /3-ga- 
lactosidase activity in Xp5(/ac p 162) plaques mea- 
sured by colorimetric enzyme assay 5 hr after 
plating was slightly higher than the background 
level of X CI857 plaques and only about 0.05% of 
the activity in X p5 lac plaques (Table 3). 

In Situ Immunoassay for Bacterial Colonies 

A mixture of a saturated MC1000 (/ac) culture and 
a D7001 (lac + ) culture were plated on YT plates 
with or without IPTG and grown up to microcolo- 
nies (about 0.5 mm in diameter) at about 1000 per 

A ^^B^ ^ C 

Figure 6. In Situ Radioimmunoassay for /3-Ga!actosidase in A 
CI857 p5 (lac p 162) Plaques 

0.1 ml of an L broth dilution of A C1857 p5 (lac p*162) was plated 
with 0.1 ml of an MC1000 (jac^) culture on YT plates containing 1 
mg X-Gal in the top layer agar and overlaid with a F(ab)Vcoated 
cellulose filter. The filter was treated with antiserum and ,25 l- 
labeled protein A, and analyzed by autoradiography as described 
in Figure 5. The X-ray film was exposed for 18 hr. A portion of the 
autoradiogram is shown in (A). 

A mixture of A CI857 p5 (/ac p"162) and A CI857 was plated (in 
a ratio of 1 :1) with MC1000 (lac?) on YT plates containing X-Gal in 
the top layer and overlaid with a F(ab)Vc'oated filter. The treated 
filter was analyzed by autoradiography. The X-ray film was ex- 
posed for 36 hr. A portion of the phage plate with a mixture of A 
CI857 and A CI857 p5 {lac p l62) plaques is shown in (B); the 
corresponding portion of the autoradiogram is shown in (C). The 
genotype of A phage in individual plaques was verified by testing 
for the ability to recombine with an E. coli strain carrying a small 
internal deletion in the/ac z gene. 



Table 3. Measurement of /3-Galactosidase Enzyme Activity in 
Bacteriophage A Plaques 

Enzyme Units per Plaque 



A Stock SE 

CI857 0.0004 ± 0.0004 

CI857p5/ac'(att)SR2 5.48 £0.63 

CI857 p5 (lac p"162) 0.002 ± 0.0004 

Individual plaques were picked 5 hr after plating and suspended 
in 0.5 ml "z" buffer (Miller, 1972). ONPG (50 M l at 4 mg/ml) was 
added, and the reaction was terminated by the addition of 100 
1 M NajCOs. Absorbance at 420 and 550 was measured. Average 
values were calculated from measurements o'f five individual 
plaques. Values calculated from measurements of five pooled 
plaques gave similar results. 
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plate. The colonies were lysed by exposing the 
plates to chloroform vapor and then overlaying 
them with a thin {1 .5 ml) agar top layer containing 
lysozyme and SDS (0.025%). The F(ab)' 2 fragment- 
coated filter was applied directly to the surface of 
the plate for 3 hr, treated with antiserum and 125 l- 
labeled protein A as described above, and analyzed 
by autoradiography. The results are shown in Fig- 
ure 7. The induced lac* colonies (that is, those 
grown on the IPTG-containing plates) are identified 
by large dark spots on the x-ray film. Uninduced 
lac* clones, which show up as very small, discrete 
spots on the x-ray film, are also clearly distinguish- 
able from the /ac 7 colonies. A sufficient number of 
cells remained viable in these microcolonies, even 
after treatment with chloroform vapor and the ly- 
sozyme-SDS top layer agar, so that these colonies, 
identified as lac* by autoradiography, could be 
tested and were confirmed to be lac* by steaks on 
MacConkey's agar plates. Since viable cells can be 
recovered from the colonies embedded in the ly- 
sozyme-SDS top layer, replica plating is not nec- 
essary for this in situ immunoassay. 

Discussion 

The most general method for detecting gene trans- 
lation products in bacterial cells is an immunoas- 
say, since it requires neither an intact nor a func- 
tional protein. The indirect radioimmunoassay de- 
scribed here for jS-galactostdase as a model pro- 
tein has a number of useful properties. Unlike 
direct radioimmunoassays, it does not require pu- 
rification and labeling of specific antibody. The 
labeled detecting reagent, protein A, can be used 
in conjunction with different antisera for screening 
clones for a variety of gene products. The assay is 
simple, yet sensitive, reproducible and quantita- 
tive, and is capable of reliably detecting below 0.2 



ng of /S-galactosidase, or 2 x 10 s molecules. 

In addition, the indirect radioimmunoassay can 
be used for the in situ analysis of bacterial colonies 
and phage plaques, and by coupling the F(ab)' 2 
fragments to the wells of a microtiter plate, individ- 
ual clones in microcultures can be assayed quanti- 
tatively or heterogeneous cultures can be analyzed 
by sib selection. For mass screening, the transfer 
of microculture lysates in a microtiter plate to the 
F(ab)Vcoated wells of another microtiter plate can 
be streamlined by using a 96 channel automatic 
pipetter (Cooke) and the plates monitored by auto- 
radiography. 

Uninduced /ac + cultures, estimated to contain 
10-20 jS-galactosidase molecules per cell (Miller, 
1972) are detected easily and reliably when the 
lysatefrom 5 x 10 7 cells is assayed. When cultured 
in the presence of IPTG, 1 /ac + in 10 3 /ac v cells, or 
as few as 5 x 10" lac* cells, can be identified. 

In our measurements for saturated L broth cul- 
tures, induced lac* cultures have about 300 times 
the enzyme activity of the uninduced lysates. As- 
suming, then, that induced lac* cells contain about 
3-6 x 10 3 molecules per cell, the indirect radioim- 
munoassay is capable of detecting 1.5-3.0 x 10 s 
/3-galactosidase molecules in cell lysates. This fig- 
ure is in good agreement with the estimate of 
sensitivity based on the detection of )S-galactosid- 
ase in solution in Figure 2. 

The chloroform-SDS lysis method was developed 
to optimize the rapid mass screening of microcul- 
tures. If the number of clones is small, however, 
concentrated, sonicated extracts of bacterial cells 
can be used, thus avoiding the problem of deter- 
gent inhibition and allowing the extract from more 
than 10 9 cells to be measured in a single well. 
Under these conditions, the sensitivity of the assay 
should be enhanced by at least an order of magni- 
tude, permitting the detection of clones producing 
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Figure 7. In Situ Radioimmunoassay for j3-Galactosidase in Induced and in Uninduced Colonies of D7001 (lac*) 

A mixture of MC1000 (iac?) and D7001 (/ac*) cultures was plated on YT plates containing IPTG (A) or without IPTG (B) and grown up to 
microcolonies (about O.S mm in diameter) at about 1000 colonies per plate. The plates were exposed to chloroform vapors for 5 min and 
then overlaid with 1.5 ml of H top agar containing lysozyme (0.25 mg) and SDS (0.025%). The F(ab) 1 ! fragment-coated filter was applied 
directly to the plate surface for 2 hr and treated with antiserum and 1M l-labeled protein A as described in Figure 5. The filters were placed 
on Kodak NS-T2 for autoradiography. (A) represents an X-ray film exposed for 13 hr; the film shown in (B) was exposed for 24 hr. 
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as little as one molecule per celt or the identifica- 
tion of 1 lac + in 10 4 /ac 7 cells. 

By coupling the F(ab)' 2 fragments to a chemically 
activated cellulose filter, the indirect radioimmu- 
noassay can be adapted, using autoradiography, 
to the in situ analysis of bacterial colonies and of 
phage plaques. Thus molecular cloning systems 
utilizing either plasmid or phage vectors can be 
analyzed with efficiency (as many as 10 3 clones per 
plate) and sensitivity. In the in situ analysis of 
plaques of A. p5 lac, dark spots on the x-ray film 
showed a one-to-one correspondence with plaques 
stained by X-Gal on the plate. The size of the x-ray 
film spot was proportional to the size of the enzy- 
matic stain surrounding the phage plaque. 

Small plaques, which had barely detectable 
halos of X-Gal stain and were therefore presumably 
producing low levels of /3-galactosidase, were eas- 
ily identified as small discrete spots on the x-ray 
film. Thus the radioimmunoassay with autoradi- 
ographic screening is more sensitive than the X- 
Gal enzymatic stain for identification of /3-galacto- 
sidase-containing phage plaques. The superior 
sensitivity of the immunoassay is also indicated by 
the ability to detect specific film darkening in the 
absence of any detectable X-Gal staining for 
plaques of A p5 (lac p~162), a phage carrying a lac 
allele with a low efficiency promotor mutation. 
Plaques of A p5 lac (0.2-1.0 mm in diameter) 
measured 5 hr after plating contained about 5.5 
units of ^-galactosidase activity, an average value 
based upon the measurement of individual 
plaques. In calculations based on the estimate of 
300,000 units per mg protein (Miller, 1972) {1 unit 
= 3.6 x 10 9 molecules), A p5 lac plaques contain 
about 2.0 x 1Q"> molecules. The /3-galactosidase 
activity of individually tested Ap5 (lac p _ 162) 
plaques is <0.05% of the activity in Ap5 lac 
plaques. Thus the lower limit of sensitivity for the 
in situ A plaque immunoassay is below 1.0 x 10 7 
molecules. 

The in situ analysis of bacterial colonies utilizes 
the same F(ab)'2-coated cellulose filter and is, in 
addition, dependent upon the lysis of colonies that 
are embedded in a top layer containing SDS and 
tysozyme. Large discrete spots on the x-ray film are 
observed corresponding to induced lac + colonies 
on plates containing IPTG. (n the absence of IPTG, 
lac* colonies (uninduced) are revealed as very 
small , yet easily discernible spots on the x-ray film . 
Thus microcolonies (0.5-1.0 mm in diameter) con- 
taining only 10-20 molecules of /3-galactosidase 
per cell are easily identified. 

Unlike earlier detection methods based on im- 
munoprecipitation, the sensitivity of the in situ 
indirect immunoassay is independent of the serum 
titer or affinity. The chemical coupling of F(ab)' 2 
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fragments to an insoluble matrix provides for a 
localized high concentration of detecting reagents 
and is therefore not dependent upon the diffusion 
of antibodies and antigen in an agar gel. Accord- 
ingly, the amount of antisera required is less than 
is needed for assays where detection occurs in the 
agar plate. In addition, since the array of colonies 
of plaques on a plate is only in temporary contact 
with the detecting antibody , the same set of clones 
can be screened with a panel of specific F(ab)V 
coated cellulose filters to monitor gene expression 
for a number of different proteins. This may prove 
useful for cloning systems in which gene families 
(for example, the histones) are being analyzed, and 
in shot-gun cloning systems where more than one 
gene is under investigation. 

Si nee about 1000 individual colonies or plaques 
can be monitored per plate, the in situ radioimmu- 
noassay represents a very powerful and sensitive 
general method for screening clones for the 
expression of foreign gene fragments cloned into 
phage or piasmid vectors. In addition, this immu- 
noassay provides a method for examining the con- 
ditions that would allow a cloned eucaryotic DNA 
sequence, previously identified by nucleic acid 
hybridization probes, to be expressed at the pro- 
tein level. 

Experimental Procedures 
lodination ol Protein A 

Protein A (purchased from Pharmacia) was labeled with 12S ! using 
chloramine T in a modification of the original method of Hunter 
and Greenwood (1962). 20 y.\ of a 1 mg/mi protein A solution in 
0.5 M phosphate buffer (pH 7.0) were mixed with 20 ^1 of 
neutrafized Ma ,M 1 (purchased from New England Njclear) and 10 
jil of a 1 mg/ml solution of chloramine T in 0.5 M phosphate 
buffer. After 1 min, the reaction was stopped by adding 50 p.\ of 1 
mg/ml tyrosine in 0.5 M phosphate buffer, and 100 of 1% SSA 
were added as carrier protein. The mixture was applied immedi- 
ately to a 10 ml column of Sephadex G-25 and eluted with 1% 
BSA. The peak fractions were pooled and stored at 4°C in the 
presence of 0.5% sodium azide. A fresh batch of reagent was 
labeled every 6-8 weeks. The specific activity of the labeled 
reagent, assuming no protein loss, varied between 15-30 /iCi//ig. 

Preparation of F(ab)' 2 Fragment of Rabbit Antf-p-Galactosidase 

The F(ab)'! fragment was derived from rabbit anti-E. coli p- 
galactosidase serum 484 according to a modification of the 
method of Madsen and Rodkey (1976). 1 ml of serum was 
incubated with an equal volume of saturated ammonium sulfate. 
The pellet resulting from two precipitation steps with 50% ammo- 
nium sulfate was resuspended in 1 ml 0.1 M sodium acetate (pH 
7.0) and dialyzed overnight against the acetate buffer (pH 7.0). 
The protein solution was adjusted to a pH of 4.3 with 1 M HC! and 
incubated with 100 jil of pepsin (Worthington Biochemical Co.) at 
4 mg/ml in 0.1 M acetate buffer (pH 4.3) for 8 hr at 37°C. After 
dialysis against PBS [0.15 M phosphate-buffered saline (pH 7.4)], 
the protein solution was applied to a Sephadex G-100 (Pharmacia) 
column and then passed over a 3 ml affinity column of protein A 
coupled to Crt-Br-activated Sepharose 4B to remove any residuat 
immunoglobulin molecules bearing Fc determinants. Passage 
over the protein A-Sepharose column alone proved sufficient for 
obtaining clean preparations of F(ab)'2 fragments. 
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Conditions for the Microliter Plate Indirect Radioimmunoassay 

The 0-galactosidase immunoassay is a modification of a plate 
assay (Klinman et at., 1976) designed to measure serum antibody. 
75 ,il of the F(ab)'i solution (0.025 mg/ml in PBS) are incubated 
in the wells of a flat-bottomed, disposable, polyvinyl chloride 
microtiter plate (Cooke Laboratory Products) for 2 hr at room 
temperature. 50 jil of antigen solution, either pure protein or cell 
lysate. are then incubated in the F(ab)Vcoated wells for 2 hr at 
room temperature. The anti-£-galaetosidase serum, at a dilution 
of 1/500-1/2000 is added to the wells for 2 hr at room tempera- 
ture. 50 fi\ of a protein A ,M 1 solution, diluted to a final concentra- 
tion of 0.1-0.3 ji,g/ml and to a final activity of about 100,000- 
200,000 cpm per well, are incubated overnight at 4°C. Unbound 
reagents are removed by washing with PBS-FCS (phosphate- 
buffered saline with 5% fetal calf serum). All dilutions are made in 
1% BSA (bovine serum albumin). ,M l-labeled protein A molecules 
bound to the wells are measured by cutting and counting the 
wells in a Searle gamma counter or by autoradiography, 

Autoradiography 

The microtiter plates were placed over Kodak XR-5 film and a 
Kodak X-Omatic Regular Intensifying Screen, and the film was 
developed after exposure for 20-40 hr. The treated cellulose 
filters were placed over Kodak NS-T2 film, and the film was 
developed after 6-18 hr of exposure. 

Preparation and Usage of fac 7 Extract-Sepharose Affinity Column 

Sonicated extract prepared from 20 ml of a saturated MC1000 
(/ac v ) culture was dialyzed overnight against coupling buffer T0.1 
M NajCQj, 0.3 M NaCI (pH 8.5)] and incubated overnight at 4°C 
with 1 .3 gm (about 5 ml) of activated Cn-Br-Sepharose-4B (Phar- 
macia). The gel was washed with 1 M ethanolamine (pH 9.0) and 
then, in alternating cycles, with 0.1 M acetate buffer (pH 4.0) and 
0.1 M borate buffer (pH 8.0) according to the manufacturer s 
instructions. 

2.0 ml of 484 antiserum (diluted 1/100 in 1% BSA) were passed 
over a 3 mi column followed by 6 ml of 1% BSA. The absorbed 
serum was therefore diluted 1/400. The column was washed with 
1 M propionic acid and stored in PBS in the presence of 0.1% 
sodium azide. 

E. coli Strains 

Strains MC 100 (!ac v ) and D7001 Vac*), obtained from Dr. Mal- 
colm Casadaban (Stanford University), were cultured in L broth. 
j3-Galactosidase was induced by the addition of 10 1 M isopropyl- 
p-thiogalactoside (IPTG; purchased from Sigma). X1776, ob- 
tained from Dr. Roy Curtiss III (University of Alabama), was grown 
in L broth, supplemented with 40 ^g/ml thymidine and 100 ^g/ml 
diaminopimmelic acid. Cultures were lysed by the addition of 
chloroform (2 drops per ml) and SDS to a final concentration of 
0.1% followed by vortexing for 1 min. Enzyme assays for 0- 
galactosidase activity in the chloroform-SDS lysates were per- 
formed using ONPG (O-nitrophenyl-j3-0-galactoside) as sub- 
strate according to Miller (1972). 

A Stocks 

Stocks XC/857p5 lac ^att)SR2, \CI857 pS(/ac p"162)and WI857, 
obtained from Dr. Malcolm Casadaban, were diluted in L broth 
arid plated with 0.1 ml of a saturated L broth culture of MC 1000 
(Sac*) in 2 ml of H top layer agar on YT plates (Miller, 1972). The 
plates were incubated at 37°C. To stain the /3-galactosidase- 
producing plaques, 1 mg of X-Gal (5-brom-5-chloro-3-indoyl-(3- 
D-galactoside) was added to the top layer (2 ml). 

Binding F(ab)' 2 Fragments to a Cellulose Filter 

Whatman 540 cellulose paper was converted to ABM (aminoben- 
iyloxy methyl) paper with treatment by i-[(m-nitrober>zyloxy) 
methyl] pyridinium chloride according to the method of Alwine et 
al. (1977). Circles were cut from the ABM paper and converted to 
the reactive diazobenzyloxy methyl (DBM) cellulose paper by 



treatment with HCI and NaNOs (Alwine et al., 1977). The activated 
filters were incubated with 2 ml of anti-/3-galactosidase F(ab)'j 
fragment (0.15 mg/ml) in Borate buffer [50 mM (pH 8)] for 18 hr at 
4°C. To inactivate the reactive sites, the filters were then incu- 
bated with 5 ml of 0.1 M glycine and 0.5% BSA in PBS for 5 hr at 
37°C. 

Conditions for the in Situ Phage Plaque Assay 

The F(ab)Vcoated cellulose filter was applied directly to the 
phage plate about 5 hr after pouring the top layer. After a 1 hr 
incubation at 37°C on the plate, the filter was removed, washed 
with PBS-FCS and incubated with 3 ml of anti-0-galactosidase 
serum (diluted 1/1000 in 1% BSA) for 3 hr, The filter was then 
washed, incubated with 3 ml of '"l-labeled protein A (0.1-0.4 mg/ 
ml; 10 s cpm/ml) for 3 hr, washed, dried and placed on Kodak NS- 
T2 x-ray film for autoradiography. 

Conditions for the in Situ Bacterial Colony Assay 

A mixture of MC1000 (lac*) and D7001 (/ac+) cultures were plated 
on YT plates, with and without IPTG (2 x 10" 3 M), and allowed to 
grow up to microcolonies (about 0.5 mm in diameter) at about 
1000 colonies per plate. The colonies were lysed by exposing 
them to chloroform vapors for 5 min and then overlaying the plate 
with 1 .5 ml of H top agar containing lysozyme (0.25 mg) and SDS 
(0.025%). The Fiab)' 2 fragment-coated filter was applied directly 
to the surface of the top layer for 2-3 hr, removed and washed 
with PBS-FCS. The filter was then treated with anti-£-galactosid- 
ase serum and ,25 l-labeled protein A, and analyzed by autoradi- 
ography as described above. 
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1 1 26. A DNA according to claim 25, comprising the sequence: 
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20 


ATT 


CTG 


CAT 


TAC 


CTG 


AAG 


GCC 


AAG 


GAG 


TAC 


AGT 


CAC 


TGT 


GCC 


TGG 


ACC 


21 


ATA 


GTC 


AGA 


GTG 


GAA 


ATC 


CTA 


AGG 


AAC 


TTT 


TAC 


TTC 


ATT 


AAC 


AGA 


CTT 


22 


ACA 


GGT 


TAC 


CTC 


CGA 


AAC, 























23 



24 27. A replicable microbial expression vehicle capable, in a transformant 

25 microorganism, of directing expression of a DNA of claim 25 or 26. 

26 

27 28. A microorganism transformed with a microbial expression vehicle capable, in 

28 said microorganism, of directing expression of a DNA of claim 25 or 26. 
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1 

2 29. The microorganism of claim 28 wherein said microorganism is Escherichia coli. 

3 

4 30. The transformed microorganism according to claim 29 wherein said 

5 microorganism is Escherichia coli K-12 strain 294. 

6 

7 3 1 . A method for producing a first polypeptide having a total of 166 amino acids and 

8 having the amino acid sequence of a mature human fibroblast interferon or a second polypeptide 

9 having a total of 165 amino acids and having the amino acid sequence of a mature human 

10 fibroblast interferon except for the amino-terminal methionine of said interferon or a mixture of 

1 1 said first and second polypeptides, comprising culturing a microorganism transformed with a 

12 microbial expression vehicle capable, in said microorganism, of directing expression of a DNA 

13 of claim 25 or 26. 

14 

15 32. A method for producing a polypeptide having a total of 166 amino acids and 

16 having the amino acid sequence of a mature human fibroblast interferon, comprising culturing a 

17 microorganism transformed with a microbial expression vehicle capable, in said microorganism, 

18 of directing expression of a DNA of claim 25 or 26. 

19 

20 33. A method for producing a polypeptide having a total of 165 amino acids and 

21 having the amino acid sequence of a mature human fibroblast interferon except for the amino- 

22 terminal methionine of said interferon, comprising culturing a microorganism transformed with a 

23 microbial expression vehicle capable, in said microorganism, of directing expression of a DNA 

24 of claim 25 or 26. 

25 

26 34. A DNA which consists essentially of a DNA which codes for human fibroblast 

27 interferon polypeptide. 

28 
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1 35. A DNA consisting essentially of a DNA which codes for mature human fibroblast 

2 interferon polypeptide having the amino acid sequence: 



3 


Met 


Ser 


Tyr 


Asn 


Leu 


Leu 


Gly 


Phe 


Leu 


Gin 


Arg 


Ser 


Ser 


Asn 


Phe 


Gin 


4 


Cys 


Gin 


Lys 


Leu 


Leu 


Trp 


Gin 


Leu 


Asn 


Gly 


Arg 


Leu 


Glu 


Tyr 


Cys 


Leu 


5 


Lys 


Asp 


Arg 


Met 


Asn 


Phe 


Asp 


He 


Pro 


Glu 


Glu 


He 


Lys 


Gin 


Leu 


Gin 


6 


Gin 


Phe 


Gin 


Lys 


Glu 


Asp 


Ala 


Ala 


Leu 


Thr 


He 


Tyr 


Glu 


Met 


Leu 


Gin 


7 


Asn 


He 


Phe 


Ala 


He 


Phe 


Arg 


Gin 


Asp 


Ser 


Ser 


Ser 


Thr 


Gly 


Trp 


Asn 


8 


Glu 


Thr 


He 


Val 


Glu 


Asn 


Leu 


Leu 


Ala 


Asn 


Val 


Tyr 


His 


Gin 


He 


Asn 


9 


His 


Leu 


Lys 


Thr 


Val 


Leu 


Glu 


Glu 


Lys 


Leu 


Glu 


Lys 


Glu 


Asp 


Phe 


Thr 


10 


Arg 


Gly 


Lys 


Leu 


Met 


Ser 


Ser 


Leu 


His 


Leu 


Lys 


Arg 


Tyr 


Tyr 


Gly 


Arg 


11 


He 


Leu 


His 


Tyr 


Leu 


Lys 


Ala 


Lys 


Glu 


Tyr 


Ser 


His 


Cys 


Ala 


Trp 


Thr 


12 


He 


Val 


Arg 


Val 


Glu 


He 


Leu 


Arg 


Asn 


Phe 


Tyr 


Phe 


He 


Asn 


Arg 


Leu 


13 


Thr 


Gly 


Tyr 


Leu 


Arg 


Asn , 























14 



15 36. A DNA consisting essentially of a DNA according to claim 35 which has the base 

16 pair sequence: 

17 ATG AGC TAC AAC TTG CTT GGA TTC CTA 

18 TAC TCG ATG TTG AAC GAA CCT AAG GAT 
19 

20 CAG TGT CAG AAG CTC CTG TGG CAA TTG 

21 GTC ACA GTC TTC GAG GAC ACC GTT AAC 
22 

23 TGC CTC AAG GAC AGG ATG AAC TTT GAC 

24 ACG GAG TTC CTG TCC TAC TTG AAA CTG 
25 

26 CAG CTG CAG CAG TTC CAG AAG GAG GAC 

27 GTC GAC GTC GTC AAG GTC TTC CTC CTG 
28 

29 GAG ATG CTC CAG AAC ATC TTT GCT ATT 

30 CTC TAC GAG GTC TTG TAG AAA CGA TAA 
31 

32 AGC ACT GGC TGG AAT GAG ACT ATT GTT 

33 TCG TGA CCG ACC TTA CTC TGA TAA CAA 
34 
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CAA AGA 
GTT TCT 

AAT GGG 
TTA CCC 

ATC CCT 
TAG GGA 

GCC GCA 
CGG CGT 

TTC AGA 
AAG TCT 

GAG AAC 
CTC TTG 



AGC AGC 
TCG TCG 

AGG CTT 
TCC GAA 

GAG GAG 
CTC CTC 

TTG ACC 
AAC TGG 

CAA GAT 
GTT CTA 

CTC CTG 
GAG GAC 



AAT TTT 
TTA AAA 

GAA TAT 
CTT ATA 

ATT AAG 
TAA TTC 

ATC TAT 
TAG ATA 

TCA TCT 
AGT AGA 

GCT AAT 
CGA TTA 



1 GTC TAT CAT CAG ATA AAC CAT CTG AAG ACA GTC CTG GAA GAA AAA 

2 CAG ATA GTA GTC TAT TTG GTA GAC TTC TGT CAG GAC CTT CTT TTT 
3 

4 CTG GAG AAA GAA GAT TTT ACC AGG GGA AAA CTC ATG AGC AGT CTG 

5 GAC CTC TTT CTT CTA AAA TGG TCC CCT TTT GAG TAC TCG TCA GAC 
6 

7 CAC CTG AAA AGA TAT TAT GGG AGG ATT CTG CAT TAC CTG AAG GCC 

8 GTG GAC TTT TCT ATA ATA CCC TCC TAA GAC GTA ATG GAC TTC CGG 
9 

10 AAG GAG TAC AGT CAC TGT GCC TGG ACC ATA GTC AGA GTG GAA ATC 

11 TTC CTC ATG TCA GTG ACA CGG ACC TGG TAT CAG TCT CAC CTT TAG 
12 

13 CTA AGG AAC TTT TAC TTC ATT AAC AGA CTT ACA GGT TAC CTC CGA 

14 GAT TCC TTG AAA ATG AAG TAA TTG TCT GAA TGT CCA ATG GAG GCT 
15 

16 AAC 

17 TTG. 

18 
19 

20 37. A cloned DNA consisting essentially of a DNA having the base pair sequence 

21 defined in claim 36. 

22 

23 38. A cloned DNA consisting essentially of a DNA which codes for a polypeptide 

24 having the amino acid sequence in claim 35. 

25 



26 39. A recombinant plasmid wherein a DNA which codes for the amino acid sequence: 



27 


Met 


Ser 


Tyr 


Asn 


Leu 


Leu 


Gly 


Phe 


Leu 


Gin 


Arg 


Ser 


Ser 


Asn 


Phe 


Gin 


28 


Cys 


Gin 


Lys 


Leu 


Leu 


Trp 


Gin 


Leu 


Asn 


Gly 


Arg 


Leu 


Glu 


Tyr 


Cys 


Leu 


29 


Lys 


Asp 


Arg 


Met 


Asn 


Phe 


Asp 


He 


Pro 


Glu 


Glu 


He 


Lys 


Gin 


Leu 


Gin 


30 


Gin 


Phe 


Gin 


Lys 


Glu 


Asp 


Ala 


Ala 


Leu 


Thr 


He 


Tyr 


Glu 


Met 


Leu 


Gin 


31 


Asn 


He 


Phe 


Ala 


He 


Phe 


Arg 


Gin 


Asp 


Ser 


Ser 


Ser 


Thr 


Gly 


Trp 


Asn 


32 


Glu 


Thr 


He 


Val 


Glu 


Asn 


Leu 


Leu 


Ala 


Asn 


Val 


Tyr 


His 


Gin 


He 


Asn 


33 


His 


Leu 


Lys 


Thr 


Val 


Leu 


Glu 


Glu 


Lys 


Leu 


Glu 


Lys 


Glu 


Asp 


Phe 


Thr 


34 


Arg 


Gly 


Lys 


Leu 


Met 


Ser 


Ser 


Leu 


His 


Leu 


Lys 


Arg 


Tyr 


Tyr 


Gly 


Arg 
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1 

2 
3 



lie Leu His Tyr Leu Lys Ala Lys Glu Tyr Ser His Cys Ala Trp Thr 
lie Val Arg Val Glu lie Leu Arg Asn Phe Tyr Phe lie Asn Arg Leu 
Thr Gly Tyr Leu Arg Asn 



4 



is inserted in a vector DNA. 



5 



6 



40. 



A microorganism containing the recombinant plasmid defined in claim 39. 



7 



8 



41. 



The microorganism according to claim 40 which is Escherichia coli K-12 strain 



9 



294. 



10 



11 



42. 



The recombinant plasmid according to claim 39 wherein said inserted DNA 



12 comprises the following base pair sequence: 

13 ATG AGC TAC AAC TTG CTT GGA TTC CTA CAA AGA AGC AGC AAT TTT 

14 TAC TCG ATG TTG AAC GAA CCT AAG GAT GTT TCT TCG TCG TTA AAA 
15 

16 CAG TGT CAG AAG CTC CTG TGG CAA TTG AAT GGG AGG CTT GAA TAT 

17 GTC ACA GTC TTC GAG GAC ACC GTT AAC TTA CCC TCC GAA CTT ATA 
18 

19 TGC CTC AAG GAC AGG ATG AAC TTT GAC ATC CCT GAG GAG ATT AAG 

20 ACG GAG TTC CTG TCC TAC TTG AAA CTG TAG GGA CTC CTC TAA TTC 
21 

22 CAG CTG CAG CAG TTC CAG AAG GAG GAC GCC GCA TTG ACC ATC TAT 

23 GTC GAC GTC GTC AAG GTC TTC CTC CTG CGG CGT AAC TGG TAG ATA 
24 

25 GAG ATG CTC CAG AAC ATC TTT GCT ATT TTC AGA CAA GAT TCA TCT 

26 CTC TAC GAG GTC TTG TAG AAA CGA TAA AAG TCT GTT CTA AGT AGA 
27 

28 AGC ACT GGC TGG AAT GAG ACT ATT GTT GAG AAC CTC CTG GCT AAT 

29 TCG TGA CCG ACC TTA CTC TGA TAA CAA CTC TTG GAG GAC CGA TTA 
30 

31 GTC TAT CAT CAG ATA AAC CAT CTG AAG ACA GTC CTG GAA GAA AAA 

32 CAG ATA GTA GTC TAT TTG GTA GAC TTC TGT CAG GAC CTT CTT TTT 
33 
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1 CTG GAG AAA GAA GAT TTT ACC AGG GGA AAA CTC ATG AGC AGT CTG 

2 GAC CTC TTT CTT CTA AAA TGG TCC CCT TTT GAG TAC TCG TCA GAC 
3 

4 CAC CTG AAA AGA TAT TAT GGG AGG ATT CTG CAT TAC CTG AAG GCC 

5 GTG GAC TTT TCT ATA ATA CCC TCC TAA GAC GTA ATG GAC TTC CGG 
6 

7 AAG GAG TAC AGT CAC TGT GCC TGG ACC ATA GTC AGA GTG GAA ATC 

8 TTC CTC ATG TCA GTG ACA CGG ACC TGG TAT CAG TCT CAC CTT TAG 
9 

10 CTA AGG AAC TTT TAC TTC ATT AAC AGA CTT ACA GGT TAC CTC CGA 

11 GAT TCC TTG AAA ATG AAG TAA TTG TCT GAA TGT CCA ATG GAG GCT 
12 

13 AAC 

14 TTG. 

15 
16 

17 43. The recombinant plasmid according to claim 39 wherein said vector DNA is an 

1 8 Escherichia coli plasmid. 

19 

20 44. The recombinant plasmid according to claim 43 wherein said Escherichia coli 

21 plasmid is pBR322. 

22 



23 45. A process for preparing a recombinant plasmid which comprises inserting a 

24 synthesized double stranded DNA which codes for the amino acid sequence: 



25 


Met 


Ser 


Tyr 


Asn 


Leu 


Leu 


Gly 


Phe 


Leu 


Gin 


Arg 


Ser 


Ser 


Asn 


Phe 


Gin 


26 


Cys 


Gin 


Lys 


Leu 


Leu 


Trp 


Gin 


Leu 


Asn 


Gly 


Arg 


Leu 


Glu 


Tyr 


Cys 


Leu 


27 


Lys 


Asp 


Arg 


Met 


Asn 


Phe 


Asp 


He 


Pro 


Glu 


Glu 


He 


Lys 


Gin 


Leu 


Gin 


28 


Gin 


Phe 


Gin 


Lys 


Glu 


Asp 


Ala 


Ala 


Leu 


Thr 


He 


Tyr 


Glu 


Met 


Leu 


Gin 


29 


Asn 


He 


Phe 


Ala 


He 


Phe 


Arg 


Gin 


Asp 


Ser 


Ser 


Ser 


Thr 


Gly 


Trp 


Asn 


30 


Glu 


Thr 


He 


Val 


Glu 


Asn 


Leu 


Leu 


Ala 


Asn 


Val 


Tyr 


His 


Gin 


He 


Asn 


31 


His 


Leu 


Lys 


Thr 


Val 


Leu 


Glu 


Glu 


Lys 


Leu 


Glu 


Lys 


Glu 


Asp 


Phe 


Thr 


32 


Arg 


Gly 


Lys 


Leu 


Met 


Ser 


Ser 


Leu 


His 


Leu 


Lys 


Arg 


Tyr 


Tyr 


Gly 


Arg 


33 


He 


Leu 


His 


Tyr 


Leu 


Lys 


Ala 


Lys 


Glu 


Tyr 


Ser 


His 


Cys 


Ala 


Trp 


Thr 
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1 lie Val Arg Val Glu lie Leu Arg Asn Phe Tyr Phe lie Asn Arg Leu 

2 Thr Gly Tyr Leu Arg Asn 

3 in a vector DNA. 
4 

5 46. The process according to claim 45 wherein said vector DNA is an Escherichia 

6 coli plasmid. 

7 

8 47. A process according to claim 46 wherein said Escherichia coli plasmid is 

9 pBR322. 

10 



11 48. A process for producing a microorganism capable of expression of a polypeptide 

12 with interferon activity which comprises transforming a host microorganism with a replicable 

13 recombinant plasmid containing a foreign DNA which codes for the amino acid sequence: 



14 


Met 


Ser 


Tyr 


Asn 


Leu 


Leu 


Gly 


Phe 


Leu 


Gin 


Arg 


Ser 


Ser 


Asn 


Phe 


Gin 


15 


Cys 


Gin 


Lys 


Leu 


Leu 


Trp 


Gin 


Leu 


Asn 


Gly 


Arg 


Leu 


Glu 


Tyr 


Cys 


Leu 


16 


Lys 


Asp 


Arg 


Met 


Asn 


Phe 


Asp 


He 


Pro 


Glu 


Glu 


He 


Lys 


Gin 


Leu 


Gin 


17 


Gin 


Phe 


Gin 


Lys 


Glu 


Asp 


Ala 


Ala 


Leu 


Thr 


He 


Tyr 


Glu 


Met 


Leu 


Gin 


18 


Asn 


He 


Phe 


Ala 


He 


Phe 


Arg 


Gin 


Asp 


Ser 


Ser 


Ser 


Thr 


Gly 


Trp 


Asn 


19 


Glu 


Thr 


He 


Val 


Glu 


Asn 


Leu 


Leu 


Ala 


Asn 


Val 


Tyr 


His 


Gin 


He 


Asn 


20 


His 


Leu 


Lys 


Thr 


Val 


Leu 


Glu 


Glu 


Lys 


Leu 


Glu 


Lys 


Glu 


Asp 


Phe 


Thr 


21 


Arg 


Gly 


Lys 


Leu 


Met 


Ser 


Ser 


Leu 


His 


Leu 


Lys 


Arg 


Tyr 


Tyr 


Gly 


Arg 


22 


He 


Leu 


His 


Tyr 


Leu 


Lys 


Ala 


Lys 


Glu 


Tyr 


Ser 


His 


Cys 


Ala 


Trp 


Thr 


23 


He 


Val 


Arg 


Val 


Glu 


He 


Leu 


Arg 


Asn 


Phe 


Tyr 


Phe 


He 


Asn 


Arg 


Leu 


24 


Thr 


Gly 


Tyr 


Leu 


Arg 


Asn , 























25 

26 49. A process according to claim 48 wherein said host microorganism is Escherichia 

27 coli K-12 strain 294. 

28 
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1 50. A recombinant expression plasmid, comprising a DNA sequence encoding a 

2 mature human fibroblast interferon having a total of 166 amino acids and unaccompanied by a 

3 human fibroblast interferon presequence operably linked therein for expression by an 

4 Escherichia coli host, whereby expression of a mature human fibroblast interferon having a total 

5 of 166 amino acids and unaccompanied by a human fibroblast interferon presequence by said 

6 host is enabled. 

7 

8 51. An Escherichia coli host cell capable of expressing a mature human fibroblast 

9 interferon polypeptide having a total of 166 amino acids and unaccompanied by a human 
10 fibroblast interferon presequence, transformed with an expression plasmid of claim 50. 

11 

12 52. A method of producing a mature human fibroblast interferon polypeptide having a 

13 total of 166 amino acids and unaccompanied by a human fibroblast interferon presequence, 

14 comprising expressing the expression plasmid of claim 50 in Escherichia coli. 

15 

16 Respectfully submitted, 

17 Dated: September 8, 2006 /Thomas E. Friebel # 29,258/ 

18 Thomas E. Friebel 

19 Jones Day 

20 222 East 41 st Street 

2 1 New York, New York 1 00 1 7-6702 

22 Tel.: (212)326-3939 

23 Fax: (212) 755-7306 

24 tefriebel@jonesday.com 

25 
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1 GOEDDEL CLEAN COPY OF CLAIMS 

2 INTERFERENCE No. 105,337 

3 

4 Pursuant to Bd.R. 110(a), Party Goeddel hereby submits a clean copy of the 

5 claims, as issued, in Goeddel's involved Patent No. 5,460,8 1 1 . 
6 

7 1 . A composition comprising water and a nonglycosylated polypeptide having the 

8 amino acid sequence of a mature human fibroblast interferon, said nonglycosylated polypeptide 

9 having a total of 165 or 166 amino acids and said composition being free of any glycosylated 
10 human fibroblast interferon. 

11 

12 2. The composition of claim 1, said nonglycosylated polypeptide having the amino 

13 acid sequence 

14 X-Ser-Tyr-Asn-Leu-Leu-Gly-Phe-Leu-Gln-Arg-Ser-Ser-Asn-Phe-Gln- 

15 Cys-Gln-Lys-Leu-Leu-Trp-Gln-Leu-Asn-Gly-Arg-Leu-Glu-Tyr-Cys-Leu- 

16 Lys-Asp-Arg-Met-Asn-Phe-Asp-Ile-Pro-Glu-Glu-Ile-Lys-Gln-Leu-Gln- 

17 Gln-Phe-Gln-Lys-Glu-Asp-Ala-Ala-Leu-Thr-Ile-Tyr-Glu-Met-Leu-Gln- 

18 Asn-Ile-Phe-Ala-Ile-Phe-Arg-Gln-Asp-Ser-Ser-Ser-Thr-Gly-Trp-Asn- 

19 Glu-Thr-Ile-Val-Glu-Asn-Leu-Leu-Ala-Asn-Val-Tyr-His-Gln-Ile-Asn- 

20 His-Leu-Lys-Thr-Val-Leu-Glu-Glu-Lys-Leu-Glu-Lys-Glu-Asp-Phe-Thr- 

21 Arg-Gly-Lys -Leu-Met -Ser-Ser-Leu-Hi s-Leu-Lys- Arg-Tyr-Tyr-Gly-Arg- 

22 I le -Leu-Hi s-Tyr-Leu-Lys- Ala-Lys-Glu-Tyr- Ser-His-Cys- Ala-Trp-Thr- 

23 Ile-Val-Arg-Val-Glu-Ile-Leu-Arg-Asn-Phe-Tyr-Phe-Ile-Asn-Arg-Leu- 

24 Thr-Gly-Tyr-Leu-Arg-Asn, 
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1 wherein X is H or Met. 
2 

3 3. The composition of claim 2, said nonglycosylated polypeptide having a formula 

4 molecular weight of about 20,027. 
5 

6 4. The composition of claim 1, 2 or 3, said composition being free of human 

7 proteins. 
8 

9 5. The composition of claim 1, 2 or 3, said composition containing a therapeutically 

10 effective amount of said nonglycosylated polypeptide and being suitable for parenteral 

1 1 administration. 
12 

13 6. The composition of claim 4, said composition containing a therapeutically 

14 effective amount of said nonglycosylated polypeptide and being suitable for parenteral 

15 administration. 
16 

17 Respectfully submitted, 

18 Dated: September 8, 2006 /Thomas E. Friebel # 29,258/ 

19 Thomas E. Friebel 

20 Jones Day 

21 222 East 41 st Street 

22 New York, New York 1 00 1 7-6702 

23 Tel.: (212)326-3939 

24 Fax:(212)755-7306 

25 tefriebel@jonesday.com 
26 
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The nucleotide sequence of human fibroblast interferon cDNA 

(Recombinant DNA; restriction analysis ; Maxam-Gilbert technique; amino acid sequence; signal peptide) 
T. Taniguchi, S. Ohno, Y. Fujii-Kuriyama and M. Muramatsu 

Department of Biochemistry , Cancer Institute, Japanese Foundation for Cancer Research, Kami-Ikebukuro , Tokyo 1 70 (Japan) 

(Received March 12th, 1980) 
(Accepted March 20th, 1980) 



SUMMARY 



DNA synthesized by in vitro reverse transcription of the interferon mRNA has been cloned and amplified as 
recombinant DNA, TpIF319-13 {Taniguchi et al., 1979). The nucleotide sequence of this IF cDNA which con- 
sists of 770 bp (excluding the A:T tails) has been determined. The data reported predict the hitherto unknown 
amino acid sequence of human fibroblast interferon and its putative signal peptide. 



INTRODUCTION 

Interferons are species-specific glycoproteins pro- 
duced by almost all vertebrates when induced by 
viruses as well as other inducers (Isaacs and Linden- 
mann, 1957; Stewart, 1979). The best characterized 
biological activity of interferons is the ability to 
elicit an antiviral state in target cells, in conjunction 
with the induction of de novo synthesis of several 
proteins, such as oligoisoadenylate synthetase, pro- 
tein kinase and phosphodiesterase (Lebleu et al., 
1976; Hovanessian et al., 1977; Hovanessian and Kerr, 
1979; Schmidt et al., 1979). Beside their antiviral 
activities, interferons appear to be involved in the 
regulation of immune responses (Johnson, 1977; De 
Maeyer and De Maeyer-Guignard, 1977), and also in 
inhibition of tumor growth (Gresser, 1977). 

At least three different species of interferons, 
which have been classified as Type I(acid-stable inter- 
ferons induced mainly by viruses and double-stranded 
RNAs) and Type II (acid-labile interferon induced 

Abbreviations: bp, base pairs; IF, interferon. 



efficiently by non-viral agents such as mitogens) are 
produced in man (Epstein, 1977; Stewart, 1979). 
Among Type I interferons, fibroblasts produce an 
interferon which is distinct immunologically as well 
as in target cell specificity from that produced by 
leukocytes, suggesting that those two interferons are 
coded for by distinct structural genes (Cavalieri et al., 
1977; Hayes et al., 1979). 

Although biological activities as well as physico- 
chemical properties of these interferons have been 
extensively studied (Knight, 1976; Rubinstein et al., 
1979; Tan et al., 1979), no complete structures have 
so far been established, mainly due to the low 
amount of the proteins produced by the human 
cells. Very little has been learnt about the gene 
structure and gene organization of interferons. 

In order to solve these problems, we have con- 
structed a hybrid plasmid containing a human fibro- 
blast interferon cDNA sequence (Taniguchi et al., 
1979) and presented evidence that the cloned DNA 
in fact codes for human fibroblast interferon (Tani- 
guchi et al., 1980). 

In this report, we present the nucleotide sequence 
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of the cDNA which encompasses the entire coding 
region as well as a part of the untranslated regions 
of human fibroblast interferon mRNA. 



MATERIALS AND METHODS 

(a) Hybrid plasmid TpIF319-13 

Isolation of a bacterial clone containing the hybrid 
plasmid TpIF319-13 has been described in previous 
papers (Taniguchi et al., 1979; 1980). All cloning 
procedures had to be conducted in a P3 faculty at the 
Cancer Institute, Tokyo, as required by the Guidelines 
for Research Involving Recombinant DNA Molecules 
issued in March, 1979 by the Ministry of Education, 
Science and Culture of Japan. (This was a totally 
unwarranted precaution from the scientific point of 
view. - Ed.) 

(b) Preparation of plasmid DNA 

Plasmid DNA was prepared according to the 
method of J. Ecsodi and C. Weissmann (personal 
communication), a modification of the procedure of 
Currier and Nester (1976). 

(c) Restriction analysis 

The location of endonuclease cleavage sites was 
determined from polyacrylamide or agarose gel 
electrophoresis patterns following the published 
procedures' (Smith and Bimstiel, 1976; Tabak and 
FlavelL 1978). 

(d) DNA sequence analysis 

DNA sequence analysis was carried out by the 
procedure of Maxam and Gilbert (1977), with minor 
modifications (C. Weissmann, personal communica- 
tion). . 



RESULTS AND DISGUSSIQN , , 

Hybrid plasmid TpIF3 19-13, consists of a cDNA 
of about 800 base pairs inserted at the £coRI site of 
pBR322. The cDNA has been shown to contain the 
mRNA sequence coding for the N-terminal region of 



the human fibroblast interferon as well as for the 
signal peptide (Taniguchi et al., 1-980). The locations 
of several restriction endonuclease cleavage sites of 
the cDNA were determined; a cleavage map is shown 
in Fig. 1. After digestions of either intact plasmid or 
isolated fragments containing the cDNA insert by one 
or more restriction enzymes, the resulting DNA 
fragments were labeled at their 5' end with tri- 
phosphate, and the nucleotide sequence analysis was 
carried out according to the procedure of Maxam and 
Gilbert (1977). The sequencing strategy is presented 
in Fig. 1 . The whole cDNA sequence, encompassing 
the entire coding region of the interferon as well as 
a part of the non-translated region, was determined 
and is presented in Fig. 2. 

As has been reported in a previous paper (Tanigu- 
chi et al., 1980), the 5' terminal region of the inter- 
feron mRNA, as deduced from the cDNA, contains 
at least 6 nucleotides before the first AUG codon 
in phase. If this AUG functions as an initiator, the 
primary translation product of human fibroblast 
interferon mRNA contains a signal peptide con- 
sisting of 21 amino acids which would eventually be 
cleaved off during secretion. Alternatively, there may 
be another AUG sequence farther upstream in a 
segment of the mRNA which was not copied into the 
TpIF319-13 cDNA. As shown previously, the mRNA 
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Fig. 1 . Restriction map of human fibroblast interferon cDNA 
insert and adjacent regions of TpIF319-13. The location of 
endonuclease cleavage sites were determined from gel electro- 
phoresis patterns following simultaneous and sequential 
digestions of either intact plasmid or isolated fragments 
containing cDNA insert. The rectangle represents the cDNA 
insert, and the shaded area the A-T junctions. The strategy 
of the DNA sequence analysis is presented in the lower part 
of the figure. Arrows indicate the direction and extent of 
sequencing of each fragment analysed; the vertical bar at the 
end of each arrow represents the 5'-terminal, labeled phos- 
phate. 
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£JEI 

GTC AAC ATG 
CA6 TTG TAC 

Am Leu Lem 
AAC TT6 CTT 
TTG AAC GM 
80 

£?B £*s Leu 

TAT TGC CTC 

ATA ACG GAG 
160 

Ala Leu Ihe 

GCA TTG ACC 

CGT AAC TGG 
240 



-20 

Jim Asa L*s Cis Leu Leu Gut 
ACC AAC AAG TGT CTC CTC CAA 
TGG TTG TTC ACA GAG GAG GTT 

Glx Ehe Leu uln Ars See Ser 

GGA TTC CTA CAA AGA AGC AGC 

CCT AAG GAT GTT TCT TCG TCG 
100 



-10 

Ile Ala Uu Leu Leu 

ATT GCT CTC CTG TTG 
TAA CGA GAG GAC AAC 

46 



his. Ase 
AAG GAC 
TTC CTG 

60 

III lie 
ATC TAT 
TAG ATA 



Ihe Ile Val £lu flsji 

ACT ATT GTT GAG AAC 

TGA TAA CAA CTC TTG 
320 

110 

Lis Gm Ase Ehe Ihr Ars 

AAA GAA GAT TTC ACC AGG 

TTT CTT CTA AAG TGG, TCC 
WO 

L*s Ala bcs Glu Iy_r Sir 
AAG GCC AAG GAG TAC AGT 

nc cgg nc ctc atg tca 

160 



Ars Hei ten Ehe Ase 

AGG ATG AAC TTT GAC 

TCC TAC TTG AAA CTG 
1&) 

Glu Hei Leu Gm Ash 

GAG ATG CTC CAG AAC 

CTC TAC GAG GTC TTG 
260 

90 

Uu Leu Ala Ash Ial 

CTC CTG GCT AAT GTC 

GAG GAC CGA TTA CAG 
340 



Asu Ehe 
AAT TTT 
TTA AAA 

_ 40 n 

iLE EfiQ 

ATC CCT 
TAG GGA 

Lle Ehe 
ATC TTT 
TAG AAA 



Glu C_y_s Glu 

CAG TGT CAG 

GTC ACA GTC 
120 

Glu Glu Lle 

GAG GAG ATT 

CTC CTC TAA 
260 

Ala Lle Ehe 

GCT ATT TTC 
CGA TAA AAG 



I*e His Gm Lle Asm 
TAT CAT CAG ATA AAC 
ATA GTA GTC TAT TTG 



1 

Cys Ehe See Ihe Ihe Ala Leu See Hei See Lye 

TGC TTC TCC ACT ACA GCT CTT TCC ATG AGC TAC 

ACG AAG AGG TGA TGT CGA GAA AGG TAC TCG ATG 
66 

l^L^UuMGLJiUuAsjiGj^AafiLEuGLu 
AAG CTC CTG TGG CAA TTG AAT GGG AGG CTT GAA 

TTC GAG GAC ACC GTT AAC TTA CCC TCC GAA CTT 
140 

Lfs Gi^ 1^ Eji GLfj Ehe Glw Us (kij As£ Ala 
AAG CAG CTG CAG CAG TTC CAG AAG GAG GAC GCC 
TTC GTC GAC GTC GTC AAG GTC TTC CTC CTG CGG 

80 

Arg Glu Ase See See See Ihe Gut Ire. Asa Glu 

AGA CAA GAT TCA TCT AGC ACT GGC TGG AAT GAG 

TCT GTT CTA AGT AGA TCG TGA CCG ACC TTA CTC 
280 300 

.00 



His Leu Lis Ihe Val Lelu Glu Glu Lis Leu Qui 

CAT CTG AAG ACA GTC CTG GAA GAA AAA CTG GAG 

GTA GAC TTC TGT CAG GAC CTT CTT TTT GAC CTC 
360 3S0 



Ihe Gut Lea Leu Arq 
CTT ACA GGT TAC CTC CGA 
GAA TGT CCA ATG GAG GCT 
560 



Gls bts leu Mej See Ser leu His Leu Lys Arc Iyr Tyr Gly Ars Lle leu Hls ttR Leu 

GGA AAA CTC ATG AGC AGT CTG CAC CTG AAA AGA TAT TAT GGG AGG ATT CTG CAT TAC CTG 

CCT TTT GAG TAC TCG TCA GAC GTG GAC TTT TCT ATA ATA CCC TCC TAA GAC GTA ATG GAC 

420 440 460 

n^O 150 

His. Cy_s Ala T^ Ihe Lle !^al Arq ¥al Glu Ile Leu Am Ash Ehe Ixr Ehe Lle Ash Ash 

CAC TGT GCC TGG ACC ATA GTC AGA GTG GAA ATC CTA AGG AAC TTT TAC TTC ATT AAC AGA 

GTG ACA CGG ACC TGG TAT CAG TCT CAC CTT TAG GAT TCC TTG AAA ATG AAG TAA TTG TCT 
500 520 540 

.66 * 1 ' 



AAC TGA AGA TCT CCT AGC CTG TGC CTC TGG GAC TGG ACA An GCT TCA AGC ATT CTT CAA 
TTG ACT TCT AGA GGA TCG GAC ACG GAG ACC CTG ACC TGT TAA CGA AGT TCG TAA GAA GTT 
580 660 6$0 
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CCA GCA GAT GCT GTT TAA GTG ACT GAT GGC TAA TGT ACT GCA TAT GAA AGG ACA CTA GAA GAT TTT GAA ATT TTT An 
GGT CGT CTA CGA CAA ATT CAC TGA CTA CCG ATT ACA TGA CGT ATA CTT TCC AGT GAT CTT CTA AAA CTT TAA AAA TAA 
640 660 6S0 7d0 

AAA TTA TGA GTT An Tn ATT TAT TTA AAT TTT An TTG GAA AAT AAA nA TH TTG GTG CAA AAG TCA AAA AAA 
TTT AAT ACT CAA TAA AAA TAA ATA AAT TTA AAA TAA AAC Cn TTA TH AAT AAA AAC CAC GTT nc AGT TTT m 
720 7fo 760 

Fig 2. Nucleotide sequence of the human fibroblast interferon cDNA and the predicted amino acid sequence of the pre-interferon 
molecule. Determination of the DNA sequence was carried out according to the procedure of Maxam and Gilbert (1977) fcf 

1 i'ii ! SeqUCnCe PeimitS prediction of the entiie amin ° acid sequence for human fibroblast interferon (amino acids' 
W(,6) and its putative signal peptide (amino acids -21 to -1) as shown in the line above the DNA sequences 
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sequence deduced from the cDNA predicts the syn- 
thesis of a polypeptide whose N-terminal amino acid 
sequence is identical to that of human fibroblast 
interferon (Knight et al., 1980; Taniguchi et al., 
1980). The sequence coding for mature interferon 
thus begins with a methionine codon which is fol- 
lowed by codons for 165 amino acids and terminates 
by UGA. The complete amino acid sequence of 
human fibroblast interferon predicted from the 
cDNA sequence is also presented in Fig. 2. 

The amino acid composition of the interferon 
reported by Knight et al., (1980) agrees largely with, 
that deduced from the cDN A sequence. The molecular 
weight of the human fibroblast interferon polypep- 
tide was calculated to be 20040; this is to be com- 
pared with the values of 20000 and 26000 for 
glycosylated fibroblast interferon (Knight, 1976; 
Hayes et al., 1979) and 16000 for unglycosylated 
interferon (Havell et al., 1977), estimated from 
electrophoretic mobilities in sodium dodecylsulfate- 
polyacrylamide gels. The protein structure deduced 
from the DNA sequence agrees with 4he physico- 
chemical properties of human fibroblast interferon in 
the following respects. Hydrophobic amino acids such 
as leucine (25), isoieucine (1 1), phenylalanine (9) and 
valine (5) are abundant and this is consistent with the 
well-known hydrophobicity of the interferon (Knight, 
1976; Tan et al., 1979). There are three cystein 
residues (positions 17, 31 and 141) in human fibro- 
blast interferon which is consistent with the notion 
that S-S bonds may play a role in the maintenance of 
the interferon activity (Tan et al., 1979). 

The sequence analysis of the coding region of 
human fibroblast interferon mRNA indicates that 
codon utilisation is non-random, with certain preferred 
codon choices for some of the amino acids (Fig. 3). 
The use of certain codons such as CUG and CUC 
for leucine, and AAG for lysine seems to be favored 
in many eukaryotic mRNAs examined to date (See- 
burg et al., . 1977; Grantham et al., 1980). It is also 
remarkable that codons containing the sequence CG 
occur quite infrequently: no CGX or XCG codon 
occurs in the interferon mRNA except one CGA 
for arginine. In addition, of the, 49 codons ending 
with C, only 2 are followed by a codon beginning 
with G. This observation seems to apply to all 
eukaryotic mRNAs (Russel et al., 1976; Fiers et al., 
1978). It may be. worth npting : that the AUA codon 
for isoieucine is not-.frequently used in other mRNAs 
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UUC 
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0 
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3 
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1 
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CAU 


3 
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0 
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0 




CUA 


2 
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0 
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CGA 


1 




CUG 


10(1) 




CCG 


0 




CAG 


8 




CGG 


0 


!le 


AUU 


5(1) 


Tkr 


ACU 


2(1) 


Asn 


AAU 




Ser 


AGU 


2 




AUC 


4 




ACC 


3(1) 




AAC 


8(1) 




AGC 


5 




AUA 


2 




ACA 


1(1) 


Lys 


AAA 


4 


Arg 


AGA 


5 


Met 


AUG 


4(1) 




ACG 


0 




AAG 


7(1) 




AGG 


5 


Val 


GUU 


1 


Ala 


GCU 


2(2) 


Asp 


GAU 


2 


Gly 


GGiJ 


1 




GUC 


3 




GCC 


3 




GAC 


3 




GGC 


1 




GUA 


0 
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1 


Glu 
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5 




GGA 


2 




GUG 


1 




GCG 


0 




GAG 


8 




. GGG 


2 



Fig. 3. Codon utilisation in human fibroblast interferon 
mRNA. The numbers indicate how many times each codon 
is used in the mRNA segment coding for the mature inter- 
feron polypeptide. The numbers in parentheses indicate the 
frequency of the codons used in the putative signal peptide. 



(Grantham et al., 1980), whereas 2 out of 11 iso- 
leucines are coded for by AUA in the case of human 
fibroblast interferon. 

At the 3'-end of the mRNA there is a long A + T- 
rich untranslated segment comprising the sequence 
AATAAA, which appears to be common to all 
polyadenylated mRNAs of eukaryotes studied up to 
date (Proudfoot and Brownlee, 1976). 

One interesting aspect regarding human inter- 
ferons is that the fibroblast and leukocyte interferons 
exhibit a number of different properties (Gresser et 
al., 1974; Havell etal., 1975 ; Berg et al., 1975;Vflcek 
et al., 1977). Since both cDNA clones are available 
(Taniguchi et al., 1979; Nagata et al., 1980) and have 
been sequenced (see also preceding paper), it became 
possible to compare the structures of the two inter- 
feron polypeptides and to determine that human 
fibroblast and leukocyte interferons are structurally 
related (to be published). 
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SUMMARY 

The coding sequences of the cDNAs of cloned human leuko- 
cyte interferon I and human fibroblast interferon show homolo- 
gies, of ASS at the nucleotide and 291 at the amino acid level. 
We estimate that the two genes were derived from a common 
ancestor about 500 to l'OOO million years ago. 
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The acid-stable human interferons are subdivided into two 
major groups, namely fibroblast interferons (F-IF) and leukocyte 
interferons (Le-IF) ; these are the major components of the inter- 
ferons produced by induced fibroblasts and leukocytes, respective- 
ly. Some cells, such as. the lymphoblastoid Namalva cell line, 
produce a mixture of 901 Le-IF and 101 F-IF (1, 2). The two inter- 
feron types have several features in common: both are glycoproteins 
with molecular weights ranging from 16'000 to 26'000 (5-9) , 
the induction and shut-off of their synthesis appears to be 
under similar control (6), and at least some of'the responses 
elicited in target cells are similar, such as induction of 
an antiviral state, which is accompanied by increased synthesis 
**. several proteins (10-13). Nonetheless, the two kinds of 
interferons differ in many respects. Antibodies directed against 
Le-IF do not neutralize F-IF and vice-versa (14), the target 
cell specificities of the two IFs differ (15), and the sequences 
of the 13 amino terminal amino acids of F-IF and of Le-IF (from 
lymphoblastoid cells) show no homology (16, 22). Although Le-IF 
. and F-IF are encoded by different mRNA species (17), it is not 
known whether these mRNAs are transcribed from distinct genes 
or whether they arise from the same gene via a common precursor 
which is processed or spliced in different modes. 

We have recently cloned .and sequenced one species each 
of Le-IF (Le-IF I) (18,19) and F-IF cDNA (20,21). A second 
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species of Le-IF (Le-IF II) cDNA has recently been identified 
GM. Streuli, S. Nagata aadC. Weissmann, unpublished results). 

In Fig. 1 the nucleotide, sequences of Le-IF I and F-IF 
cDNA were aligned so- That the AUGs closest to their 5' termini 
coincided. From the amino terminal sequence published for F-IF 
(16) and lymphoblastoid Le-IF (22) one can determine that in 
the case of F-IF the 21st codon following the initiation triplet 
and in the case of Le-IF the 23rd codon represents the first 
amino acid of the interferon polypeptide. Presumably the stretch 
in between encodes a signal peptide. Since the putative signal 
peptide of Le-IF comprises 23 and that of F-IF 21 amino acids, 
the IF polypeptides, as aligned in Fig. 1, are shifted by two 
residues relative to their termini. In this alignment, 48 of 
166 positions (29t) of the interferon polypeptides have identical 
amino acids. To plot the degree of homology between the F-IF 
and Le-IF as function of the map distance, the sequence was 
subdivided into segments of 8 amino acids (or 24 nucleotides),- 
each overlapping by 4 amino- acids (or 12 nucleotides) with the 
neighboring segments, and the percent coincidence of amino acids 
(and nucleotides) for each segment was determined (cf . van Ooyen 
et al., ref. 23). As seen in Fig. 2, amino acid sequences show 
three domains of homology. The first one, with the least degree 
of homology, corresponds to the 'putative signal sequence, which 
is rich in hydrophobic residues and has 4 identical amino acid 
positions out of 21; the second domain, between the 28th and 
80th amino acid (counted on the Le-IF sequence), has 21 identical 
residues out of 51 (41i homology) and the third, between 

. i 
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positions 115 and 151 (Le-IF sequence), has 19 out o£ 35 identical 

residues (54\). The longest stretches of contiguous conserved 

amino acids are Gln-Phe-Gln-Lys (pos. 47-50 of Le-IF and 49-52 

of F-IF) and Cys-Ala-Trp (pos. 139-141 and pos. 141-143, respective 

the latter sequence is notable because it comprises Cys and 

Trp, which are preferentially conserved in related proteins 

(24). Table 1 shows that conservation was highest between 

the interferon polypeptides (not considering the signal sequences) 

for Trp, Phe, Arg, Cys and Tyr residues, in agreement with 

the general experience that the amino acids most likely to 

be Conserved between related proteins are Trp > Cys > Tyr > Arg > . 

Phe, His (24). Even where amino acids are conserved, the codons 

show one or more nucleotide changes in half the instances. 

The codons of 3 out of 7 conserved Leu residues are non-related, 

as are 2 of 4 codons pertaining to conserved Ser residues. 

This suggests that there is a strong selective pressure favoTing 

the conservation of several amino acids. It is quite likely 

that at least some of the conserved amino acids are essential 

for a function- common to Le-IF and F-IF, perhaps the induction 

of the virus-resistant state in the taTget cell. These findings 

may provide guidelines for the tailoring of modified (2S) , 

possibly shorter polypeptides possessing certain activities 

of interferon. 

The nucleic acid sequences show an average homology of 
431 in the domain of the signal sequence and of 451 in the 
interferon polypeptide sequence. On a random basis, about 
251 of the nucleotide positions should 
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coincide. Within the interferon coding sequence, the nucleotide 
homologies are more evenly distributed than the amino acid homologies 
However, one may distinguish, albeit to a less pronounced 
degree, the same two blocks of similarity noted for the amino 
acids. The longest region without mismatches extends for 13 
nucleotides (cf. 47th to Slst codon of Le-IF vs. 49th to 53rd 
codon of F-IF). There are, in addition, sequences of 17, 18 
and 20 nucleotides with 3, 3 and 4 mismatches respectively . 
The heteropolymeric 3' -terminal non-coding region of Le-IF 
cDNA has 24 2 nucleotides, and is longer by 39 residues than 
its counterpart in F-IF cDNA. In aligning the two sequences 
fouT gaps were introduced to maximize homology, as described 
by van Ooyen et al^ Thereby , several segments were matched 
with 29 to 4U homology. The introduction of gaps in the alignment 
oay be justified in view of the arguments presented previously, 
that introns and non-coding regions of reduplicated genes 
. diverge as a consequence of block insertions and/or deletions 
in the course of evolution (23,26). 

It is unlikely that the extent of homology between Le 
and F-IF cDNA would allow meaningful crosshybridization between 
the two species. 

On the basis of our findings there is no doubt that Le-IF 
and F-IF /genes are derived from a common ancestral sequence. 
When did the separation of these genes occur? Human a and 
& glob in show 571 amino acid mismatches, and human &-globin 
and myoglobin, as well as a-globin and myoglobin, 761 mismatches, 
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If the rate of divergence of interferons and globins is comparable 
(however, cf. p. 50, ref. 24, for proteins showing both higher 
and lower rates) then the separation of interferon genes occurred 
after that of myoglobin and hemoglobins and before that of 
o- and fS-globins, i.e. between S00 and 1000 million years 
ago (24). The interferon gene* may thus be about as old as the 
vertebrates (27) « 
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TABLE 1 Conservation of amino acids in leukocyte and fibroblast 
* 

interferon. 

Number of changes in codon 
Conserved of conserved amino acids 





F-IF 


Le-IF 


amino acids 


0 
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Leu 


25 


22 


8 


1 
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Cys 


3 


5 
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1 


1 
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12 


• - 6 


.1 


1 
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11 


12 
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Phe 


9 
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4 


2 
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Pro 
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10 
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. 1 


Ala 
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2 
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13 


15 
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Ser 


9 


13 


4 




•> 
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2 




Tyr 
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1 


3 


Val 


S 


6 


1 


.1 




Asp 


5 


11 


1 


1 




Thr 


6 


9 


0 






Gly 


6 


3 


0 






Met 


4 


6 


0 






His 


S 


3 


0 








166 


166 


48 


24 


18 



* 

The data are from Taniguchi et al. (ref. 21) and Maatei 
ct al. (ref. 19). 
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flGURE LEGENDS 

Fig. 1 Comparison of the nucleotide sequences of human leuko- 
cyte interferon I (Le-IF I) and human fibroblast 
interferon cDNA and of the derived amino acid sequences. 
The sequences are from Mantei et al.(19) and Taniguchi 
et al. (21). They were aligned to give maximal homology. 
Identical amino acids are framed, identical nucleotides 
are marked by a dot. SI to S23 indicate the amino 
acids of the putative signal sequence; 1 to 166 the 
amino acids of the interferon polypeptides. 



Fig. 2 Similarity of the nucleotide and amino acid sequences 
of human leukocyte interferon 1 and fibroblast inter- 
feron. 

The sequences shown in Fig. 1 were subdivided in 
segments of 8 amino acids or 24 nucleotides, each 
overlapping by 4 and 12 residues, respectively, with 
the neighboring segments. The percentage of coincident 
residues was plotted as a function of map position. 
Open vertical blocks, nucleotides; filled vertical 
blocks, amino acids. L-IF, leukocyte interferon cDNA; 
F-IF, fibroblast interferon cDNA; lines, non-coding 
sequences; hatched bars, putative signal peptide; 
open bars, interferon polypeptide. 
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Specification 



1. Title of the Invention 

Novel recombinants having a gene which shows com- 
plementarity to the human interferon messenger RNA. 

2. Scope of Claim for Patent 

(1) Novel recombinants that are made by inserting a 
DNA synthesized with human interferon messenger RNA as a 
template into a vector DNA, having a gene which shows com- 
plementarity to the human interferon messenger RNA. 

(2) The novel recombinant in claim 1, wherein the 
recombinant is a recombinant plasmid DNA. 

(3) The novel recombinant in claim 2, wherein the 
recombinant plasmid DNA is a recombinant Escherichia coli 
plasmid DNA. 

3. Detailed Description of the Invention 

This invention relates to novel recombinants havir. 
a gene which shows complementarity to the human interferon 
messenger RNA. 

Interferon is a glycoprotein (molecular weight 
approx. 20,000) with antiviral activity, discovered by 
Isaacs and Lindenmann in 19 57. Subsequent studies have 
revealed its antitumor activity besides its antiviral 
activity and hence a wide clinical application of this 
substance has gradually started. For instance, it is 
reported that this substance is effectively administered to 
various viral diseases, osteosarcoma and mammary carcinoma. 

However, because of its high species-specificity, 
only the interferon derived from human cells can be used 
for human application. At present, the interferon which is 
being used for administration has a relative activity of 
about 10 (International units) per 1 mg, which corresponds 
to the purity of about 0.1 - 0.01%. 
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Further,, the wide use of the interferon is 
impossible because of the difficulty of its mass-production. 
At present even for. the interferon requirement for clinical 
tests UO 13 units per year), the supply is only about 1%. 
For these reasons, development- of technology to produce 
human interferon with a high purity easily and also in 
large quantities is desired. 

The present inventors thought that it was the 
novel technique for- producing interferon with ease and in 
a large quantity to insert a human interferon gene into a 
plasmid DNA (for instance plasmid DNA derived from 
Escherichia coli) with the recombinant DNA (deoxyribonucleic 
acid) technology. The inventors have completed this inven- 
tion based on the thought. 

That is, this invention relates to a novel 
recombinant, having a gene which shows complementarity to 
the human interferon messenger RNA, in which a DNA synthe- 
sized using the human interferon as a template is inserted 
into a vector DNA- The 'aim of this invention is to provide 
novel recombinants which grow and amplify in bacteria such 
as Escherichia coli and, as a result, can be used to produce 
human interferon in bacteria such as Escherichia coli. 

The novel recombinant (or recombinant plasmid 
DNA) having a gene which shows complementarity to the human 
interferon messenger RNA has been obtained for the first 
time by the present inventors. The novel recombinant is a 
very useful substance which may be used for amplification 
in bacteria such as Escherichia coli and production of 
human interferon in large quantities and at low cost. 

The novel recombinant of this invention can be 
obtained by the following procedure. First, cytoplasmic 
RNA is extracted from either human fibroblast, MG6 3 cells 
or others induced by poly (I) (C) which is a double-stranded 
RNA composed of polyinosinic acid and polycytidylic acid 
and is sold by the U.S. CALBIOCHEM Co. etc., or human . 
leucocytes, lymphoblastoid cells, NAMALVA cells- or others 
induced by Sendai virus, or lymphocytes induced by various 



Board Assigned Page #2101 




- 3 - 

mitogens. From this RNA, the human interferon messenger 
RNA (hereinafter messenger RNA is referred to as mKNA) 
containing poly A (polyadenylic acid) is isolated. 
A double- stranded DNA is synthesized by reverse transcriptase, 
etc. with the in RNA portion having high interferon toRMA 
activity as the template. A recombinant is obtained by 
inserting the synthesized DNA into a vector DNA such as 
Escherichia coli plasmid DNA 'by the technique of recoisbinan- 
DNA. 

The process of producing the present recombinant 
is explained in detail below. 

First, human fibroblasts may be obtained from 
fetus-derived foreskin, etc. 

A small amount of interferon is added to the 
culture fluid of human fibroblasts to prime production of 
interferon by human fibroblasts, to which poly (I) (C) is 
added to induce the production of interferon. Cycloheximide 
is added simultaneously to increase the production of inter- 
feron (superinduction', and the level of interferon mENA. 
At an appropriate time (e.g. 4 hours) after human fibro- 
blasts are superinduced in this way, cells are collected 
and destroyed and nuclei are removed. Cytoplasmic total 
RNA is extracted with phenol, etc. RNA can also be 
extracted by destroying the whole cells, extracting both 
DNA and RNA with phenol, etc. and degrading and removing 
DNA with DNAase. 

Further, RNA can also be extracted from MGS3 
cells which can be obtained from patients of osteosarcoma, 
etc. instead of human fibroblasts, by inducing as described 
above. 

Furthermore, RNA can be .induced in human leucocytes, 
lymphoblastoid cells which can be obtained from lymphoma 
patients or NAMALVA cells which can be obtained from lynrohorr.a 
patients by incubation with Sendai virus (which can be 
obtained from medical schools) , etc. and extracted therefrom 
as described above. 
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RNA can also be extracted from lymphocytes which 
are induced by various mitogens by the above described 
techniques. 

The thus extracted RNA is dissolved in a salt 
solution with a high concentration (e.g. solutions of NaCI 
and KC1) , and put on a column of oligo (dT) cellulose sold 
by P-L Biochemicals Co., USA to adsorb mRNA having poly (A) 
on the column. Elution is carried out with water, a salt 
solution with a low concentration such as. 10 mM Tris-HCl 
buffer, or the like to isolate mRNA having poly (A) . 

The isolated mRNA is fractionated according to 
the difference mainly in molecular weight by sucrose density 
gradient centrif ugation. Interferon mRNA activity in each 
fraction is checked by determining interferon activity 
(antiviral activity) of the protein which is synthesized 
in oocytes of African claw toad ( Xenopus leavis) by micro- 
injecting a part of the fraction. .... 

Then, a DNA complementary to the mRNA is synthe- 
sized in vitro by a reverse ' transcriptase which is obtained 
from avian myeloblastosis virus, using an mRNA having the 
highest specific activity fraction as the template. 

The synthesis. is carried out as follows. 

An mRNA is reacted at an appropriate temperature 
(e.g. 37°C) for an appropriate period (e.g. 60 min.) with 
oligo(dT) sold by. P-t Biochemicals Co. ,. USA, MgCl 2 (e.g. 
5 mM) , NaCl (e.g. 30 mM) , mercaptoethanol (e.g. 5 mM) and 
Tris-HCl buffer (e.g. pH 8.0, 40 mM) using a reverse tran- 
scriptase together with deoxyadenosine triphosphate (dATP) , 
deoxythyraidine triphosphate (dTTPJ , deoxyguanosine triphosphate 
(dGTP) and deoxycytidine triphosphate (dCTP) (e.g. 0.5 mM 
each) as substrates. 

The thus obtained reaction product containing 
mainly DNA is subjected to deproteinization with phenol 
etc. and the template RNA is. removed by alkali treatment. 
A double- stranded DNA is synthesized by a reverse transcrip- 
tase in a similar way as the synthesis of the DNA complemen- 
tary to mRNA described above except that mRNA is replaced 
by DNA and oligo (dT) is excluded. 
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Incidentally, by using Escherichia coli DNA 
polymerase I which can be obtained from Escherichia coli 
MRE 600, etc. instead of reverse transcriptase, the same 
double- stranded DNA can be synthesized. 

After the double- stranded DNA which is synthesized 
by the above described procedure is treated with Nuclease 
Sj_ which can be obtained from Aspergillus orvzae in the 
presence of ZnCl 2 (e.g. 1 mM) , etc., sodium acetate buffer 
(e.g. 0.1 M, pH 4.5), NaCl (e.g. 0.2 M) ,. etc., deoxyadenine 
chains are formed at both 3' ends of. the synthesized DNA by 
incubating with a terminal transferase (e.g. sold by Bethesca 
Research Laboratories, USA) purified from calf thymus in the 
presence of potassium cacodylate buffer (e.g. pH 7.6, 0.14 K), 
Tris (base) (e.g. 0.03 M) , dithiothreitol (e.g. 0.1 mM) , 
CoCl 2 (e.g. 1 mM) and &ATP (e.g. 1 mM) at an appropriate 
temperature (e.g. 37°C) for an appropriate period (e.g. 20 
min . ) . 

On the other hand, a plasmid DMA which is used 
as a vector DNA, e.g. Escherichia col:, plasmid pBR322 DNA 
[Gene vol. 2, p. 95-113 (1977)], is cut at one site by 
treating with a restriction endonuclease EcoRl, which can 
be obtained from Escherichia coli RY13, etc. in the presence 
of Tris-HCl buffer (e.g. pH 7.5, 10 mM) , MgCl 2 (e.g. 6 mil)-, 
NaCl (e.g. 0.1 M) , mercaptoethanol (e.g. 6 mM) , etc. and 
then treated with phage X-derived exonuclease , which can 
be obtained from Escherichia coli W3102 ( X cI851 x 13) , etc. , 
in the presence of Na-glycine buffer (e.g. pH 9.5, 0.1 M) , 
MgCl 2 (e.g. 5 mM) , etc. Thereafter deoxythymidine chains 
are formed at both 3 1 ends in the same way as for the above- 
described synthesized double-stranded DNA by using dTTP 
instead of dATP. 

Synthetic double- stranded DNA and plasmid DNA 
which are chain-elongated at both 3' ends as described above 
are incubated at an appropriate temperature for an appro- 
priate period with Tris-HCl buffer (e.g. pH 7.5, 50 mM> , 
NaCl (e.g. 0.1 M) , EDTA (e.g. 5 mM) , etc. and hybridized 
with hydrogen-bonds of adenine and thymine. Then, a trans- 
formable Escherichia coli strain, e'.g. Escherichia coli x 
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1776 [Molecular Cloning' of Recombinant DNA, Scott, W.A. & 
Werner, R. edited. Academic Press p. 99-114 (1977)] is 
transformed with the hybridized DNA by the method of Enea 
et al. (J. Mol. Biol. Vol.96, p. 495-509, 1975) or the like. 

In the novel recombinant plasmid DMA thus obtained, 
there exists a vector DNA gene, e.g. S-lactaraase (enzyme 
that destroys ampiciliin) gene, of Escherichia coli plasmid 
pBR322. Therefore, the transformed Escherichia coli shows 
resistance to ampiciliin. The following technique is used 
to pick up a strain with a novel recombinant having a gene 
which shows complementarity to the human interferon messenger 
RNA among these ampiciliin resistant strains. 

First, [ 32 P] labelled DMA is synthesized with the 
RNA having interferon mRNA activity described above as a 
template and the DNA- is hybridized with mRNA extracted, 
without induction by poly(I) (C) (therefore, this mRNA does 
not contain interferon mRNA) , from the human fibroblast by 
incubating at a high temperature (e.g. 65°C) in a reaction 
mixture containing NaCl (e.g. 0.5 M) , etc. Then, the 
hybridized DNA (Probe A) and non-hybridized DNA (Probe B) 
are separated by hydroxyapatite column chromatography. . 
Next, filter-fixed DNAs of transformants are hybridized 
separately with Probe B or Probe A according to the technique 
of Grunstein-Hcgness [Proc. Nat. "Acad T "Sci. • USA, vol.72, 
p. 3961-3965 (.1975)] and strains having a DNA hybridizable 
with Probe B but not or hardly with Probe A are discerned 
by autoradiography. 

Then, plasmid DNA is isolated from the thus 
discriminated strain and hybridized with mRNA having 
interferon mRNA activity by incubating at a high temperature 
(e.g. 53°C) in the presence of 80% (w/v) formamide, 0.4 M 
NaCl, etc. Since the mRNA hybridized with the plasmid DNA 
from the above-described strain does not pass through 
nitrocellulose filter and is trapped, this mRNA is eluted 
from the filter at a high temperature (e.g. 60°C) with a 
solution such as 9 0% (v/v) formamide and injected into 
oocytes of African claw toad and interferon activity is 
determined as described above. 
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X«Ihen interferon activity is determined to be 
positive with this procedure, the DMA used for hybridization 
is concluded to be a DNA having a base sequence complemen- 
tary to interferon mRNA and by this method/ a recombinant 
plasmid DNA having a gene complementary to the human inter- 
feron mRNA can be discriminated. 

Novel recombinants of this invention are very 
useful because they enable mass production of interferon 
in Escherichia coli or in eukaryotic cells which can be 
grown in a large scale. 

The invention will be explained more in detail in 
the following example. However, this invention will not be 
restricted thereto. 

Example : 

After priming of human fibroblasts by overnight 
incubation with HEM culture medium containing interferon 
{25 O/ml) , they were superinduced by adding 10 ug/ral of 
poly (I) (C) and 5 ug/ml of cycloheximide to the medium. 

After 4 hours, 10 9 superinduced human fibroblasts 
were destroyed by Teflon homogenizer in the presence of 
0.3 % NP-40 and 50 ug/ml heparin in RS3 buffer (10 mM Tris- 
HC1, pH 7.5; 10 mM NaCl; 1.5 mM MgCl 2 ) . Nuclei were removed 
by centrif ugation, and 9.6 mg of cytoplasmic RNA was 
obtained by extracting 3 times with phenol. 

The cytoplasmic RNA was precipitated with ethanol, 
dissolved in 10 ml of 1 mM EDTA solution and incubated at 
65°C for 2 minutes. 2.5 ml of a salt solution with a high 
concentration {0.5 M Tris-HCl, pH 7.5; 1 M NaCl; 50 mi-! 
EDTA) was added to the above solution and the mixture was 
put on an oligo(dT) cellulose column to adsorb mRNA con- 
taining poly (A). Elution was carried out with a salt 
solution with a low concentration (10 mM Tris-HCl, pH 7.5) 
and water to isolate 250 ug of mRNA containing poly (A). 

The. mRNA was precipitated with ethanol and 
dissolved in 0.5 ml of 1 mM EDTA solution. The solution 
was incubated at 65°C for 2 minutes, subjected to centrifu- 
gation through a 5-25% sucrose-density gradient (rotated 
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at 35,000 rpm using the SW40 rotor of Beckmann Co., U.S.A.) 
at 4°C for 16 hrs,, and fractionated into 20 fractions. 

The interferon mRNA activity of each of these 
fractions was determined as mentioned above . The result is 
shown in Table 1. 



Table 1 



Fraction No. 


Interferon Activity 


9 


< 50 units/ml 


10 


44 


u 


550 


i 12 


52 



mRNA in Fraction No. 11 was approximately 5 pg. 
The mRNA and a reverse transcriptase were incubated at 37 °C 
for an hour in 20 pi of a reaction mixture [5 pg mRNA; 1 pg 
oligo(dT); 8 units reverse transcriptase; 5 mM MgClj; 30 mM 
NaCl; 5 mM mercaptoethanol; 40 mM Tris-HCl, pH 8.0] and 
deproteinized with phenol. After RNA was removed by the 
treatment with 0.3 N NaOH at 37°C for 15 hours, the synthe- 
sized single-stranded DMA was incubated at 37°C in 20 pi 
of a reaction mixture [the same mixture as described above 
except that mRNA and oligo(dT) were excluded] for an hour 
to synthesize 1.5 pg of a doub'le-stranded DMA. 

The double-stranded DNA was treated with Nuclease 
S]_ in 50 pi of a reaction mixture (1.5 pg double-stranded 
DNA; 1 mM ZnCl 2 ; 0.1 M sodium acetate, pH 4.5; 0.2 M NaCl; 
0.05 unit S^) at 37 °C for 30 minutes, and the enzyme was 
removed with phenol . The DNA was precipitated with ethanol 
and then treated with a terminal transferase in 20 pi of a 
reaction mixture (1.5 pg DNA; 0.14 M potassium cacodylate , 
pH 7.6; 0.03 M Tris (base); 0.1 mM dithiothreitol ; 1 mM 
CoCl 2 ; 1 mM dATP; 1 unit terminal transferase] at 37°C for 
20 minutes. Thus about 100 deoxyadenosine chains were formed 
at both 3' ends of the double- stranded DNA. 

On the other hand, 10 pg of Escherichia coli 
plasraid pBR322 DNA was treated at 37°C for 2 hours with a 
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restriction endonuclease EcoRI in 100 yl of a reaction 
mixture (10 mM Tris-HCl, pH 7.5; 6 mM MgCl 2 ; 0.1 M NaCl; 
6 mM mercaptoethanol ; 10 units EcoRI) leading to the 
cleavage at the only one cutting site in pBR322 DXA. 
The cut plasmid DNA was treated with an exonuclease derived 
from phage \ in 200 yl of a reaction mixture (10 yg DMA; 
0.1 M Na-glycine, pH 9.5; 5 mM MgCl 2 ; 50 ug/ml albumin; 
17.5 units X exonuclease) at 0°C for 90 minutes, and the 
enzyme was removed with phenol. The DNA was treated with 
a terminal transferase in 50 yl of a reaction mixture (10 
yg DNA; 0.14 M potassium" cacodylate , pH 7.6; 0.03 M Tris 
(base); 0.1 mM dithiothreitol ; 1 mM CoCl 2 ; 1 mM dTTP; 
2 units terminal transferase) at 37 °C for 20 minutes. 
This led to the elongation of about 100 deoxythymidylate 
residues on both 3' ends of plasmid pBR322 DNA described 
above. 

0.02 yg of the synthesized double- stranded DMA 
thus obtained, and 0.1 yg of the plasmid pBR322 DNA were 
incubated for hybridization in a solution containing 0.1 M 
NaCl, 50 imM Tris-HCl (pH 7.5) and 5 mM EDTA at 65°C fcr 
2 minutes, at 45°C for one hour, at 37°C for one hour and 
at room temperature for one hour. Then, Escherichia coli x 
1776 was transformed with the hybridized recombinant 
following the method of Enea et al. 

About 4,000 ampicillin-resistant strains were 
isolated in this way. 3,600 resistant strains were chosen, 
and each DNA was fixed on two nitrocellulose filters 
(Grunstein-Bogness Method) . 

On the other hand, [ 32 P] labelled single-stranded 
DNA was synthesized (about 0.44 yg, specific radioactivity 

a 

approx. 6 x 10 c.p.m./yg) by a reverse transcriptase xn 

the same way as that for single- stranded DNA mentioned above 
; '32 

(dCTP was labelled with P) using the interferon mRKA 

1 fraction (about 10 yg) which had been extracted and 

partially purified as described above, as a template. 

The DNA was hvbridized in 50 yl of a reaction mixture (25 yg 

32 

mRNA; 0.45 yg single- stranded DNA labelled with P; 0.5 M 
NaCl; 25 mM Pipes buffer, pH 6.5) at 65 °C for 40 hours with 
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25 Wg of mRNA extracted from human fibroblasts which had 
not been induced by poly (I) (C) by the same method as to 
extract poly (I) (C) -induced mRNA . The reaction mixture was 
put on a hydroxyapatite column, and elution is first carried 
out with 0.14 M phosphate buffer (pH 6.5) to eluate the 
single- stranded DNA, and then with 0.4 M phosphate buffer 
to eluate the DMA hybridized with RNA. . As the result, the 
DNA (about 90 % of the whole) (Probe A) which hybridized 
with the mRNA mentioned above, and the DMA (about 10 % of .' 
the whole) (Probe B) which did not hybridize with it were 
isolated. 

Each Probe was hybridized with the above DNA fixed 
on the nitrocellulose filter according to the Grunstein- 
Hogness method. By means of autoradiography four strains 
were isolated which react mainly to Probe B but little to 
Probe A. 

Table 2 shows the extent of reaction of the DMAs 
from the four strains to each Probe as revealed by auto- 
radiogram. 



Table 2 



Ampicillin- 

resistant 

strains 


Extent of Reaction of Probe 
with DMA in the strains 


Probe A 


Probe B 


# 319 


+ + 


+ + + + 


# 644 


+ 


-f + + 


# 746 




+ + 


#3578 


+ 


-r -f- -f -i- -h 



. Plasmid DNA was isolated from cells of the four 
strains by the method of Currier and Nester [Analyt. Biochem. 
vol. 76, p. 431-441 (1976)]. Then, these DNAs were hybridized 
with the interferon mRNA as follows ; 

First, 5 ug of plasmid DNA was incubated with 
restriction endonuclease Hind III which can be obtained 
from Haemophilus influenzae Rd in 50 pi of a reaction mixture 
£10 mM Tris-HCl, pH 7.5; 6 mM MgCl 2 ; 50 mM NaCl; 6 mM 
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mercaptoethanol; 5 units Hind III) at 37°C for 2 hours and 
thus cut with the restriction endonuclease. After depro- 
teinization with phenol, the cut DNA was precipitated with 
ethanol and dissolved in 20 pi of 80 % (w/v) formamice. 
The solution was heat-denatured at 85°C for 10 minutes and 
was incubated in the solution' consisting of 2.5 ug raRSA , 
20 pi 80 % (w/v) formamide, 20 mM Pipes buffer (pH 6.5), 
0.4 M NaCl and 5 mM EDTA, at 53°C. Four hours later the 
mixture was mixed with 0.4 ml of 3 x SSC (1 x SSC correspor.: 
to 0.15 M NaCl, 0.015 M sodium citrate) at 0°C, and was 
filtered through a nitrocellulose filter (diameter : 1 cm, 
pore size : 0.45 um) at a rate of about 0.5 mi per minute. 
After washing the filter with about 1.5 ml of 2 x SSC, the 
filter was immersed in a solution consisting of 0.6 ml of 
90 % (v/v) formamide, 20 mM Pipes buffer, 0.1 % SDS (sodium 
dodecyl sulfate) and 5 mM EDTA. The incubation of the 
filter at 60 °C for 2 minutes and the removal of the solutic. 
were repeated 3 times and the RNA eluted from the "nitro- 
cellulose filter into the solution (1.8 ml) was precipitate- 
with ethanol. mRNA containing poly (A) was isolated from 
the RNA by using oligo (dT) cellulose column chromatography, 
dissolved in a mixture of 3 pi of 10 mM Tris-HCl (pH 7.5) 
and 88 mM NaCl and injected into the' oocytes of African 
claw toad. The interferon activity (antiviral activity) 
in the synthesized protein was determined. 

Table 3 shows the interferon mRNA activity of the 
mRNA which has hybridized with the DNA derived from the 
four bacterial strains mentioned above. 



Table 3 



Bacterial 
strain 


Interferon mRNA 
activity (unit/ml) 


8 319 


360 


5 644 


< 10 


# 746 


. 15 


S3578 


< 10 


pBR322DNA 


< 10 
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Further experiment was carried out using strain 

#319 DMA. 

5 ug of plasmid DNA obtained from strain #319 by 
the Currier and Nester method was cut with restriction 
endonuclease Hind III in the same way as mentioned above. 
The DMA and the recombinant plasmid 8GpBR322 DNA (the 
vector was pBR322) (obtained from the Institute for 
Molecular Biology I of University of Zurich) containing 
rabbit 0-globin gene, separately or as a mixture, were 
hybridized with the mixture of rabbit globin mRNA (obtained 
from rabbit red blood cells) (1 yg) and interferon mRNA 
(2.5 vg) obtained from human fibroblasts under the same 
conditions as mentioned above. The mRNA which formed 
hybrid was injected into the oocytes of an African claw 

toad. The oocytes were incubated for 15 hours in Barth 1 s 

3 3 
culture medium containing [ H] labelled histidine and [ H] 

labelled globin was isolated by acrylaraide gel electro- 
phoresis and determined quantitatively by f luorography . 
The interferon was determined by antiviral activity as 
described above. The synthesis of rabbit 3-globin and the 
human interferon was determined in this way. • The result 
is shown in Table 4 . 



Table 4 



D N A 


Synthesized 
interferon activity 


Amount of globin 
synthesized 


# 319 

BGpBR322 

mixture of 
both plasmids 


200 (units/ml) 
35 

160 


- 

+ + + + 

i 

+ + + j 



From the result of this experiment it has been 
proved that DKA of #319 has DNA (the interferon gene) which 
forms a hybrid specifically with the interferon mRNA. 

Applicant: Juridical Foundation, Japanese 
Foundation for Cancer Research 

Attorney: Junichi Sakata, Patent Lawyer 
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Dr. Tadatsugu Taniguchi 
Department of Biochemistry 
Cancer Institute 

Japanese Foundation for Cancer Research 
Tokyo, Japan 

Dear Dr. Taniguchi: 

I enjoyed very much our phone conversation. My laboratory at Harvard 
University would be delighted to collaborate' with you to achieve expression 
of your fibroblast interferon gene in E. coli . As you know, we have 
been working for some years developing methods that enable us to express 
efficiently eukaryotic (and prokaryotic) genes in E. coli . We have 
recently developed new procedures that are perfectly adapted to the problem 
of expressing the fibroblast interferon gene. I outline those procedures oh 
enclosed separate pages. We believe that within a month or two we could 
construct plasmids that would direct synthesis of relatively high levels 
(approximately 10,000 molecules/cell) of two forms of fibroblast interferon. 
One of these would match exactly (in primary sequence) the molecule secreted 
by fibroblast cells, and the other would include the leader peptide. Tt 
would be interesting to compare the biological activities of these two 
molecules. 

I imagine we might proceed in either of two ways. First, you might send us the 
clone. We would apply our methods to express it and immediately send to 
you those clones making the two forms of interferon mentioned above. Vie 
now have someone in the lab who can assay interferon activity and so we 
could see the matter through to this stage. Alternatively, you would be 
welcome to spend part or even all of the time required to do the work in 
my lab. (We are overcrowded , but we would manage somehow.) You might 
find it most interesting to come after we have done some of the necessary 
preliminary work with the clone. The exact procedure we use and hence the 
time involved, will depend upon the position of various restriction sites, 
etc. Still another "possibility would be for you to come for the . in it I 
phase of the work. 

It is important to note that I am writing to you solely as a member of 
Harvard University. I am not connected with any outside corporation 
■ (such as Biogon or Genent.e.ch) and we would work as scientific collaborators 
with -no relation to members of those corpora tions . If it is necessary to 
enable you to visit us, I can pay for a part or all of your expenses. 
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Dr. Tadatsugu Taniguchi Page 2 

February 1, 19S0 

I hope you share my feelings of enthusiastic urgency about this matter because, 
as you probably know, various groups in America (Genentech, Dupont, etc.) 
and elsewhere are working very hard to isolate and express- the fibroblast 
interferon gene. Incidentally we have several people in the lab who 
are experts at DNA sequencing, and we could help you complete the sequence 
if that would expedite matters. 

I hope to hear from you so that we can begin this most exciting collaboration 
very soon. Please feel free to telephone me collect. My numbers are: 

617 495-2336 
617 495-2329 
617 868-8006 

Yours truly, 
Mark Ptashne 

Professor of Biochemistry and 

Molecular Biology, 
Harvard University 
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Expressing Eukaryotic Genes in E. Coli 

The background to our current methods is found in the three enclosed papers 
(Backman and Ptashne, 1978 Cell 13,65; Roberts et al . ' 79a PNAS 76, 760; 
Roberts et al.'79b PNAS 76, 5596). The essential requirements are that the 
% eukaryotic gene is transcribed efficiently and that translation of the 

: MRNA begins at a specified AUG encoded by the eukaryotic gene. This results in 

production of a native (unfused) protein with methionine at its amino- 
terminus. In the case of fibroblast interferon, the first residue of the 
exported protein is methionine, and our method can be applied to produce 
either the entire protein including the leader peptide or only the smaller 
version that is secreted by the fibroblast cells. 

The principle of our method is to create a DNA molecule encoding a "hybrid 
^ ribosome binding site", consisting of the ATG from the , eukaryotic gene and 

the "Shine-Delgarno" sequence of the lacZ gene. When these two sequences are 
correctly joined the mRNA is translated efficiently beginning at the desig- 
nated AUG. The 'first use of a hybrid ribosome binding site is .described in 
Backman and Ptashne (1978). In Roberts et al. '79a we show that we can 
use a "portable promoter" to position a promoter (and a Shine-Delgarno 
sequence) at varying distances in front of a gene. In Roberts et al.'79b 
we show how this technique can be applied to express a eukaryotic gene (t) . 
Our more recent methods greatly simplify and increase the efficiency of 
3 the procedures. These methods are unpublished and are outlined in the following 

paragraphs. 

We have constructed, three plasmids (series pLG) that bear the DNA encoding 
the amino-terminal portion of the laci gene fused to DNA encoding the 
carboxyl- terminal portion of lacZ . The carboxyl-terminal portion of 
(I -galactosidase encoded by this DNA is enzymatically active. These 
plasmids have been modified by removing the 5' end of the laci gene and 
by inserting in the remaining portion of laci specific DNA linker frag- 
^ ments bearing restriction enzyme sites. When cut with the appropriate 

j enzyme, each of the three plasmids exposes the i-2 fusion in a different 

; translational reading frame. To express a eukaryotic gene (e.g., fibroblast 

; IF) , we proceed as follows. We first isolate a DNA fragment that encodes 

=' the amino-terminal portion of the IF gene. We then insert this DNA fragment 

! into the appropriate pI,G plasmid so that a fusion protein is encoded that 

; .would read: 

I NH - IF - laci - lacZ - COOH 

Little or no /2-gal would be synthesized , probably, because even if 
transcribed, the mRNA would not bear an efficient E. coli ribosome 
binding site. Next, we une the methods of Roberts et al. '79a and '79b 
to position the portable promoter, and its Shine-Delgarno sequence, at vary- 
ing distances from the ATG of the IF gene. We transform colls and look for 
those colonies producing la r ge amounts of fi. -galactosidase; these have tho 
' promoter (and Shine-Del g.i rno sequence) at optimal distances from Lin' IF 

ATG, and arc therefore producing relatively largo amounts of. the ennyrr-.n t i ca 1 1 y- 
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the methionine T k° " PtOCedure to use the ATG encoding either 

the secrlted Lot. b <;^ng of the leader or that at the beginning of 
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after SDS-polyacrylaraide gel elee- 
tiophorcsis, the eluate should be centri- 
r ged at 20,000 rev/min (Sorvall SS-34 
tor) for 20 minutes to remove particu- 
te matter before dialysis. Coomassie 
lue staining of the gels to locate protein 
lands does not interfere with subsequent 

^sequenator analysis. 

';' New technologies such as the im- 

: proved amino acid sequencing method 
described above lead to new research 
opportunities. With the greater sensitivi- 

..■'ty provided by this technique, we now 
can obtain amino acid sequence informa- 
tion on both proteins and peptides with 
submicrograni (picomole) quantities. 
This sensitivity should permit analysis of 
biomedically relevant molecules— such 
as the interferons— that can only be ob- 
tained in microgram quantities, and this 
ability opens possibilities for further 
study of these molecules. For example, 
knowledge of the amino acid sequence 
permits the synthesis of corresponding 
DNA probes and opens the possibility of 
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new strategies for isolating genes, such 
as those for interferons, that express low 
levels of messenger RNA's (8). 

Michael W. Hunkapiu.f.r 
Leroy E. Hood 
Division of Biology, California Institute 
of Technology , Pasadena 91125 
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Human Fibroblast Interferon: Amino Acid Analysis and 
Amino Terminal Amino Acid Sequence 

Abstract. The purification of human fibroblast interferon has been simplified to a 
two-step procedure consisting of affinity chromatography on Blue Sepharose and 
sodium dodecyl sulfate pofyacryiamide gel electrophoresis, A preliminary amino 
acid composition and the sequence of the 13 ammo-terminal residues of homoge- 
neous interferon prepared by this method is reported. 



Since the discovery of interferon, its 
purification and chemical character- 
ization have been primary goals of inter- 
feron research. Although their attain- 
ment has been slow because of the small 
quantities of interferon proteins avail- 




Fraction number 

Fig. 1. (a) Fractionation of crude interferon on 
a large column of B!ue Sepharose. Blution of 
interferon with 50 percent ethylene glycol in 
column buffer .begins at fraction i. (b) Small 
Blue Sepharose column. Fractions 7 to 1? in 
(a) were pooled, passed through the small col- 
umn, and elated with 50 percent ethylene gly- 
col in column buffer (fractions 1 to 2(1). 

SCIENCE, VOL. 207, I FEBRUARY 1980 



able, purification to homogeneity has 
now been achieved with some inter- 
ferons. However, only microgram quan- 
tities have been available for character- 
ization — human fibroblast interferon. {/, 
2), human lymphoblastoid interferon (3), 
human leukocyte interferon {4), mouse 
interferon (J^and only limited structur- 
al information has been acquired (4, 6). 

A thorough understanding at the mo- 
lecular level of the numerous phenomena 
that are caused by interferon in cells in 
culture and in animals will not be pos- 
sible untO the elucidation of primary and 
secondary structures of the interferon 
proteins is achieved. This structural in- 
formation will permit (i) comparison of 
amino acid sequences of interferons 
from various cell types and animal spe- 
cies, (ti) identification of the polypeptide 
segments involved in binding to inter- 
feron-specific cell-surface receptors, and 
(iii) chemical synthesis of interferons. , 

We now report an improved procedure 
for the purification of human fibroblast 
interferon that can be used to provide 
enough protein for structural studies. 



Using the automated protein micro- 
sequencing technique described in (7), 
we have determined the sequence of 
the 13 amino acid residues at the amino 
terminus of the interferon prepared by 
this method. We also report a prelimi- 
nary amino acid composition of the pro- 
tein. 

Human diploid fibroblast cells (FS-4) 
were cultured and interferon was pro- 
duced (/}. Interferon was assayed by a 
microtechnique (S) with vesicular stoma- 
titis virus as the challenge virus. Inter- 
feron units are given in National Insti- 
tutes of Health human fibroblast inter- 
feron units. 

The crude interferon, 10 to 15 liters 
produced in the absence of serum, was 
made \M in NaCl and passed at room 
temperature through a column (4 by 10 
cm) of Blue Sepharose (Pharmacia, Inc.) 
equilibrated with 6.02M sodium phos- 
phate buffer, pH 7,2, containing 1M 
NaCl. The interferon was retained 
whereas more than 95 percent of the to- 
tal protein passed through the column. 
The interferon was eluted with a mixture 
of the column buffer and ethylene glycol 
(1 : 1), and each fraction was diluted im- 
mediately with 0.5 volume of the buffer 
(Fig. la). Fractions containing interferon 
activity were pooled, diluted with two 
volumes of the column buffer, and 
passed through a. small (1 by 6 cm) Blue. 
Sepharose column for concentration. 
The interferon was eluted as described 
above (Fig. lb). 

Fractions containing interferon were 
pooled, dialyzed against I mM tris-HCl, 



Table 1 . Amino acid composition of human 
fibroblast interferon. 



Amino 



Composition 



acid 


Mole 


Residues per 




percent 


20,000 dalldns 


Asp 


li.i 


18.9 


Thr 


4.0 


6.8 


Ser 


6.2 


10.5 


Olu 


15.9 


27,0 


Pro 


1.6 


2.7 


Gly* 


4.6 


7.8 


Ala 


5.9 


lo.o 


Cyst 


1.0 


1.7 


Val 


3.5 


6.0 


Met 


1.7 


2.9 


He- 


5.3 


9.0 


Leu 


12.0 


20.4 


Tyr 


4.4 


'7.5 


Phe 


5.5 


9.4 


His 


2.9 


4.9 


Lys 


6.8 


51,6 


Arg 


6.4 


10.9 


Trp? 


0.6 


i,0 



"Includes correction for free glycine present in 
unhydrotyzed protein. tDctcrmined after per- 
formic acid oxidation, ^Determined after hydrol- 
ysis. with merc-aptoctbanesulfonic acid. 
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Rg. 2. Preparative electrophoresis of inter- 
feron, activity profile. Fractions 6 to 15 in pig. 
lb were pooled, concentrated, and subjected 
to electrophoresis in.. a- poiyacrylamide slab 
gel, 0.75 mm thick.- Fractions 14 and 15 were 
pooled and processed for amino acid sequenc- 
ing, (b) Poiyacrylamide slab gel, staining of 
proteins eluted from preparative gei <n (a). 
Approximately' 2 percent of the protein in 
fractions 14 and IS (a) was subjected to elec- 
trophoresis and stained. Lanes 1 and 3, stan- 
dard proteins; lane 2, interferon. 



■ Ovalbumin 



Chymotrypsinogen 



' V- 



lysozyme 



pH 6.8, containing 0,02 percent sodium 
dodecyl sulfate (SDS, Bio-Rad elec- 
trophoresis grade), and concentrated to 
dryness in a -vacuum centrifuge. The in- 
terferon was then subjected to elec- 
trophoresis on a SDS-poIyaeryiamide 
slab gel and eluted (Fig. 2a). Fractions 
eluted from the gel were assayed for in- 
terferon- activity (Fig. 2a); Approximate- 
ly 0.2 fig of interferon from the peak ac- 
tivity fraction was subjected to elec- . 
trophoresis in this system again, and the 
gei .wns stained with Coomassie blue 
(Fig- 2b). . 

The preparative electrophoresis trac- 
tions containing interferon were pooled 
and centrifuged for 30 minutes at 30,000 
rev/min at 4°C to remove poiyacrylamide 
gel particles. The interferon solution was 
dialyzed first against 0.15M NaCl con- 
taining 0.1 percent SDS and then against 
0.02 percent SDS. The dialyzed inter- 
feron was concentrated to dryness in a 
vacuum centrifuge. 

This purification procedure is simpler 
and shorter than that described pre- 
viously (/)- Recoveries from the large 
Blue Sepharose column have ranged 
from 50 to 100 percent, and those from 
the small column approach 100 percent. 
The interferon (5 x 10 7 U/mg) eluted 
from these columns is stable for at least 4 
weeks at 4°C in I M NaCl, 35 percent eth- 
ylene glycol, pVL 7.2. Recoveries of ac- 
tivity from the SDS gels have ranged 
from 5 to 20 percent, and specific activi- 
ties of this protein have ranged from 2 x 
I0 9 to 8 x if) 8 . U/mg. Accurate specific 
activities are difficult to- determine, and 



1 4 
H-Me t-Ser -Tyr -Asn -Leu -Leu - 
7 10 :;- . 13 . 

Gly-Phe-Leu-Gln-Arg-Ser-Ser - 

Fig. 3. The ammo-terminal amino acid sequence 
of human fibroblast interferon. 



two- to fourfold differences above 1 x 
10 s U/mg are probably not meaningful. 
Overall yields of purified interferon from 
10- to 15-liter batches of crude material 
(5 x 10 7 . to 7 x 10* total units, 8. x 10 1 
U/mg) have averaged around 10 percent. 
This gives 5 to ) 0 /xg of homogeneous in- 
terferon. 

Amino acid analysis on 1- to 2-^g por- 
tions was performed on a Durrum &-500 
amino acid analyzer (Table 1). Auto- 
mated Edman degradation on 0.4- to 2- 
m portions of the purified interferon was 
performed on a spinning cup sequenator 
(7). Phenylthiohydantoin {Pth) amino 
acids' were identified by high-perform- 
ance liquid chromatography (HPLC) on 
a Du Pont Zorbax CN column (9). 

The sequence of the 13 amino terminal 
amino acid residues of human fibroblast 
interferon was determined by this micro- 
sequencing technique (Fig, 3). Yields of 
Pth methionine at cycle 1 for three se- 
quenator runs ranged from 60 to 100 per- 
cent (based on protein determination by 
amino acid analysis), and the sequenator 
repetitive cycle yields were 92 to 95 per- 
cent. Any unblocked minor peptide se- 
quence present at > 5 percent of the re- 
ported sequence could have been detect- 
ed by the methods used, -but none has 
Board Assigned Page #21 18 



been observed. This result coupled with 
the high initial Pth yields confirmed the 
homogeneity of the interferon poly- 
peptide preparation. 

Determining the amino acid sequence 
of a protein is essential in order to identi- 
fy its active site and to understand the 
molecular mechanism of action. Com- 
parison of structural features of inter- 
ferons from different species and from 
different cell types within an animal will 
prove or disprove whether they are dif- 
ferent proteins. If there is an active site 
common to all interferons, it should be 
identifiable by comparison of the amino 
acid sequences. Comparison of the ammo 
terminal sequence reported here for hu 
man fibroblast interferon does not as yet 
reveal any apparent homology with the 
ammo-terminal sequence reported for hu 
man lymphoblastoid - interferon Q0). 
However, there is limited homology (3 of 
13 residues identical) with the 37,000- 
dalton mouse Ehrlich ascites cell mtei 
feron (//)• 

11. Knight, Jr. 
Central Research & Development 
Department, E. I. du Pont 
de Nemours & Cornpany, . 
Wilmington, Delaware 19898 

M.:W. HUNKAi'lLLER 

Division of Biology, California Institute 
of Technology, Pasadena 91 109 

B. D. Korant, R. W. F. Hakdy 
Central Research & Development 
Department, E. /. du Pont 
de Nemours & Com party 

L- E. Hood 

Division of Biology , 
California institute of Technology 
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Nonglycosylated Mouse L Cell 
Interferon Produced by the Action 
of Tunicamycin* 

{Received for publication, September 18, 1978) 

Jun-ichi Fujisawa, Yoichiro Iwakura, and Yoahimi 
Kawade 

From the Institute for Virus Research, Kyoto University. 
Sakyo-ku, Kyoto 606, Japan 

SUMMARY 

The effect of tunicamycin, a specific inhibitor of pro- 
tein glycosylation, on interferon production by L cells 
induced by Newcastle disease virus was studied. The 
interferon yields in the presence of tunicamycin ranged 
from 20 to 100% of control in various experiments. The 
two molecular species of normal L cell interferon with 
40,000 and 24,000 daltons were completely eliminated, 
and a new species of a smaller size (18,000 daltons) was 
produced. Radiolabeling of interferon with [^methi- 
onine and [ 3 H]glucosamine added to the induced cul- 
tures, as studied by immunoprecipitation and poly- 
acrylamide gel electrophoresis, indicated that the two 
species of control interferon were both glycoproteins 
but tunicamycin interferon contained little or no sugar. 
In support of this was the loss of binding to lectin from 
Wistaria floribunda, A change in physical property of 
the molecule due to the absence of sugar was found in 
decreased heat stability in the presence of sodium do- 
decyl sulfate, although in its absence no significant 
difference from control interferon was observed. These 
results indicate that tunicamycin is an effective agent 
in producing nonglycosylated but fully active inter- 
feron molecules; the sugar moiety appears to be unnec- 
essary for the antiviral activity and for secretion from 
cells. 

Increasing evidence indicates that interferons are glycopro- 
teins (1-5). As with many other glycoproteins, the functional 
role of the carbohydrate remains largely obscure. Dispensa- 
bility of at least part of the carbohydrate moiety for the 
antiviral activity was indicated by glycosidase treatment of 
human interferons (3). Molecular heterogeneity of some in- 
terferons appears to be due to variations in their carbohydrate 
moieties. Removal of the carbohydrate by treatment of inter- 
feron with glycosidases or periodate, or by inhibition of gly- 
cosylation during its biosynthesis, resulted in reduced charge 
heterogeneity, as well as in decreased molecular weights (3-5). 
But these methods may not completely remove the carbohy- 
drate, or they may entail inactivation and some side effects 
on the molecule. 

To clarify the role of the carbohydrate moiety of interferon, 
we have examined the effect of tunicamycin on production of 
mouse L cell interferon induced by Newcastle disease virus. 
This antibiotic specifically inhibits formation of the N-acetyl- 

* This work was aided in part by a grant from the Japanese 
Ministry of Education. The costs of publication of this article were 
defrayed in part by the payment of page charges. This article must 
therefore be hereby marked "advertisement" in accordance with 18 
U.S.C. Section 1734 solely to indicate this fact. 
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glucosamine-lipid intermediate involved in W-glycosidic link- 
ing of core oligosaccharides to protein (6-8), and has been 
successfully used to inhibit glycosylation of various secretory 
and viral membrane proteins (9-14). A previous study in this 
laboratory indicated the glycoprotein nature of L cell inter- 
feron by its binding to some plant lectins (15). In this com- 
munication, we report that inhibition of glycosylation by 
tunicamycin eliminates interferon molecules of normal size 
and forces molecules of a smaller size containing little or no 
sugar but with full antiviral activity to appear. 

MATERIALS AND METHODS 

Interferon Induction— Monolayers of L cells were grown in Eagle's 
minimum essential medium supplemented with 5% calf serum on 30- 
mm plastic tissue culture dishes (Falcon), and infected with Newcastle 
disease virus at a multiplicity of 30 to 40 plaque-forming units per cell 
(16, 17). After an adsorption period of 1 h at room temperature, the 
inoculum was removed, and 0.6 ml of Earle's balanced salt solution 
containing L-glutamine (100 jig/ml) and kanamycin (200 fig/ml) was 
added to each dish. After 20 h at 37°C, the culture fluid was collected 
and clarified by centrifugation. To inactivate the virus, it was incu- 
bated with 0.5% Triton X-100 at 37°C for 10 min. 

Tunicamycin, generously supplied by Professor G. Tamura of the 
University of Tokyo, was added after the virus adsorption process, 
and was present throughout the incubation period. 

For double labeling of interferon, the culture fluid was replaced, at 
7 h after infection (by which time little interferon was produced), by 
0.6 ml of fresh medium with the glucose content reduced to 0.2 g/liter 
containing 50 fiCi of L-[ 35 S]methionine (1.98 Ci/mmol) and 120 fiCi of 
[ Hjglucosamine (28.8 jiCi/mmol) (New England Nuclear, Boston). 
The fluid was harvested and processed as above. 

Purification of Radioactive Interferon— Rabbit anti-interferon se- 
rum of high specificity, raised against partially purified L cell inter- 
feron (18), and absorbed by various conceivable impurities (19), was 
used to purify the radiolabeled interferon, as will be described in 
detail elsewhere,' To the culture fluid from each dish were added 25 
^1 of the antiserum, and the mixture was incubated at 37°C for 15 
min and then at 4°C overnight. Goat anti-rabbit y-globulin (Fujizoki 
Pharmaceutical Co, Tokyo), 17.5 ul, was added, and after incubation 
at 37°C for 30 min and at 4°C overnight, the precipitate formed was 
collected by centrifugation at 18,000 x g for 20 min. After washing 
twice with 0.2 mt of NaCl/P, 2 containing 0.5% Triton X-100 and once 
with NaCI/Pi alone, the precipitate was dispersed by sonication in 0.2 
ml of 20 irai sodium phosphate buffer at pH 7.1 containing 0.15% 
SDS, 10 fiM dithiothreitol, 2 mjw EDTA, and 10% glycerol and 
incubated at 70°C for 30 min. Then, 0.2 ml of NaCl/P : was added and 
0.4 ml of 10% trichloroacetic acid was added at 4°C. The precipitate 
was collected by centrifugation and washed with 0.5 ml of 5% trichlo- 
roacetic acid at 4°C three times and with 0.5 ml of acetone at ~10°C 
three times to remove trichloroacetic acid. 

Polyacrylamide Gel Electrophoresis— SDS-polyacrylamide gel 
electrophoresis was performed as previously described (19), using 
12.5% gel to minimize the influence of the sugar moieties of glycopro- 
teins on their mobility. The precipitate of purified radioactive inter- 
veron was dissolved in 50 mM sodium phosphate buffer, pH 7 1 
containing 5% SDS, 0.1 mM dithiothreitol, 2 min EDTA, and 10% 
glycerol, heated at 70°C for 30 min, and electrophoresed at 1 mA/gel 
for 3 h and at 4 mA for 12 h. The distributions of interferon and 
radioactivity were determined on 2-mm slices of the gel, each of which 
was extracted at 4°C overnight with 0.2 ml of Eagle's minimal 
essential medium containing 5% calf serum. 

Interferon Assay— The antiviral activity of interferon was deter- 

1 S. Yonehara, Y. Iwakura, and Y. Kawade, manuscript in prepa- 
ration. 

* The abbreviations used are: NaCl/P^, phosphate-buffered saline 
(0.02 M sodium phosphate, pH 7.4, 0.14 m NaCl); SDS. sodium dodecyl 
sulfate. 
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mined by the 50% inhibition point of [ 3 H]uridine incorporation into 
vesicular stomatitis virus RNA in L cells as previously described {18, 
20). The titers in this report are expressed in international units. 

Leetin-containing Column — Wisstaria floribunda agglutinin, a d- 
galactose-specific lectin, covalently coupled to Sepharose, was sup- 
plied by Dr. Yoko Yamamoto of this laboratory, and used as described 
by her (15). 

RESULTS AND DISCUSSION 

When virus-induced L cells were incubated in the presence 
of tunicamycin (0.5 to 10 fig/ml), the interferon yields in the 
culture fluid varied considerably from one experiment to 
another, ranging from 20 to 100% of control without tunica- 
mycin. The reason for this variation is not clear, but regardless 
of whether or not the yield was reduced, a distinct change in 
molecular size was observed for interferon produced in the 
presence of tunicamycin. Thus, L cell interferon normally 
consists of a major component of 40,000 daltons (S interferon) 
and a minor one of 24,000 daltons (F interferon), as analyzed 
by SDS-polyacrylamide gel electrophoresis (18, 19). Treat- 
ment with tunicamycin (2 jug/ml) resulted in almost complete 
disappearance of both the S and F species and in formation of 
a new peak of interferon activity with 18,000 daltons, as shown 
in Fig. 1. We shall designate this species as T interferon. 

To make clear whether S, F, and T are glycosylated or not, 
we examined incorporation into them of [ M S]rnethionine and 
[ 3 H]glucosamine added to the induced cultures. The labeled 
interferons were purified from the culture fluid by immuno- 
precipitation and analyzed by SDS-gel electrophoresis. In Fig. 
la, the two peaks of the antiviral activity in the control 
preparation were accompanied by the radioactivity of both 
[^methionine and [ 3 H]glucosamine, indicating that both S 
and F interferons are glycoproteins. (The radioactive peak at 
about 45,000 daltons is probably an impurity, which was not 
always observed, especially when smaller amounts of the anti- 
interferon serum were used.) In contrast, the incorporation of 
[ 3 H]glucosamine into the interferon proteins was strongly 
suppressed relative to that of [ 33 S]methionine in tunicamycin- 
treated cultures, as shown in Fig. 16. It is concluded that the 
small interferon molecules (T) produced in the presence of 
tunicamycin contain little or no carbohydrate. The lower 
apparent molecular weight of T is consistent with this. Havell 
et al. (5), using 2-deoxyglucose or glucosamine as glycosylation 
inhibitors, also observed production of smaller interferon mol- 
ecules from human fibroblasts, but in their experiments, nor- 
mal size molecules were not completely eliminated even with 
an inhibitor dose that strongly suppressed the interferon 
yields. 

The binding properties of the interferons to a lectin from 
Wistaria floribunda also indicated impairment of glycosyla- 
tion due to tunicamycin. As shown in Fig. 2, control interferon 
was bound partly to the lectin and eluted with D-galactose 
followed by a weak acid, as previously reported (15), but 
interferon produced in the presence of tunicamycin was not 
bound at all. 

These results are in good harmony with the known mode of 
action of the antibiotic, and indicate that the carbohydrate 
moiety of interferon molecules is dispensable for the antiviral 
activity, a conclusion also derived using different means of 
reducing the sugar content (3-5). The data in Fig. 1 indicate 
that the specific activity of T interferon, based on the 
[ 35 SJmethionine radioactivity, was not lower than that of S 
and F molecules. Thus, tunicamycin appears to be a useful 
agent, in its specificity and completeness of action, to produce 
nonglycosylated and fully active interferon molecules. 

The antiviral activity of normal L cell interferon was essen- 
tially stable against heating at 70°C for an hour in the presence 
of SDS and a reducing agent. But, interferon produced in the 
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Fig. 1. SDS-polyacrylamide gel electrophoretic patterns of L cell 
interferon produced in the presence (6) and absence (a) of tunica- 
mycin. Interferon samples labeled with [^glucosamine and 
[^methionine, purified from the culture fluids by immunoprecipi- 
tation (see "Materials and Methods"), were run for determination of 
the radioactivities. Portions of the same culture fluids, after incuba- 
tion at 37°C for 20 min with added SDS (1%) and glycerol (20%), were 
electrophoresed for determination of interferon activity, a, control 
interferon. Recovery, 12% of interferon, 32% of 3 H, and 33% of 
radioactivities, b, interferon from cultures treated with tunicamycin 
(2 jig/ml). Recovery, 13% of interferon, 23% of a H, and 28% of 35 S 

radioactivities. O— O, interferon activity; x x, 3 H radioactivity; 

and A- - -A, M S radioactivity, The apparent molecular weights were 
determined from the mobilities of the following reference proteins 
run in a parallel gel: bovine plasma albumin (68,000), ovalbumin 
(45,000), chymotrypsinogen A (25,000), and myoglobin (17,000). 

presence of tunicamycin was inactivated by heating under the 
same conditions, as shown in Fig. 3a, indicating a substantial 
change in molecular characteristics due to the absence of 
sugar. 

However, in the absence of SDS, the rates of inactivation 
at 45°C with or without 0.5% Triton X-100 were not signifi- 
cantly different between control and tunicamycin interferons, 
as shown in Fig. 3, b and c. Thus, it seems unlikely that 
thermal inactivation of nonglycosylated interferon in the cul- 
ture fluid before being harvested was an important cause of 
reduced yields often observed in tunicamycin-treated cultures. 
As another possible cause of the yield reduction, we examined 
whether tunicamycin inhibited secretion of interferon to cul- 
ture fluid without affecting its synthesis. In preliminary ex- 
periments, however, no appreciable accumulation of intracel- 
lular interferon was found, suggesting that the carbohydrate 
moiety of the molecules probably plays no essential role for 
secretion. 
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...^ 2 ' Chromatography of interferons produced in the presence 
(o) and absence (a) of tunicamycin on Wistaria floribunda agglutinin 
coupled to Sepharose. Crude interferon samples, dialyzed against 
NaCl/Pi containing 20% glycerol, were charged to a column (1 ml bed 
volume) containing 1 mg of the lectin, and the unadsorbed material 
was eluted with the same buffer. Further elution was done with- El 
01 m D-galactose in NaCl/P s ; E2, 0.1 m acetic acid; and E3, 2 m NaCl' 
all containing 20% glycerol. Fraction volume, 1.5 ml. Recovery of 
interferon, 42% in a and 136% in b. 

The structural relationship among S, F, and T is an in- 
triguing question, and tunicamycin will be useful for clarifying 
it. We have found that purified S and F, upon tryptic digestion 
produce a number of chromatographicaliy identical peptides^ 
suggesting the presence of common structure in their poly- 
peptide moieties. 3 The present results are consistent with the 
notion that S, F, and T differ from each other merely in the 
sugar moiety, and S and F are both formed by glycosylation 
of T. But other possibilities are by no means excluded. Further 
studies are in progress to elucidate the relationship among the 
different interferon species and the biological and structural 
roles of the sugar moiety. 

Acknowledgments— We sincerely thank Professor Gakuzo Tamura 
of Department of Agricultural Chemistry, the University of Tokyo 
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Summary 

In this paper we describe a method for constructing 
E. coli plasmids that direct efficient expression of 
genes that encode eucaryotic or procaryotic pro- 
teins. No functional assays for the proteins are 
needed, and they are produced in their native, un- 
f used state. The only requirement is that the genes 
be isolable without intervening sequences. We de- 
scribe as an example the construction of a plasmid 
that directs the synthesis of about 10,000-15,000 
monomers per cell of rabbit /J-globin. The essential 
steps in a typical construction are as follows. 
— A region of the gene encoding the ammo-terminal 
portion of the protein is fused to DNA encoding an 
enzymatically active carboxy terminal fragment of 
/j-galactosidase. The latter is carried on one of 
three plasmids designed to facilitate the fusion (the 
construction of these three plasmids is described 
in the Appendix). 

— A "portable promoter" of the lac operon is placed 
at many positions in front of the fused gene using 
nucleases in vitro. Those promoter placements that 
elicit efficient expression of the fused gene are 
identified by the /t-galactosidase activity that they 
express. (In the special case we describe, plasmids 
identified as directing efficient expression of /?- 
globin were found to bear "hybrid" ribosome bind- 
ing sites consisting of the Shine-Dalgarno sequence 
carried on the promoter fragment and the ATG of 
the /J-giobin gene.) 

— The gene of interest is reconstituted intact, with 
the portable promoter in place, by recombination in 
vitro or in vivo. 

Introduction 

One current goal of recombinant DNA research is to 
maximize the expression of procaryotic and eucar- 
yotic genes cloned on plasmids of the bacterium E. 
coli. Our methods, previously reported and extended 
in this paper, have been directed toward producing 
large amounts of the products in their native states, 
that is, unfused to other proteins. In this regard, our 
approach differs from that of some others who have 
described methods for producing proteins that are 
hybrids of procaryotic and eucaryotic amino acid se- 
quences {for example see Fraser and Bruce, 1978; 
Mercereau-Puijalon et al., 1 978; Seeberg et ai., 1 978; 



Vitla-Kamaroff etal., 1978). Our techniques were orig- 
inally developed to produce large amounts of two 
regulatory proteins encoded by phage A, the repressor 
and era product (Backman and Ptashne, 1978; Rob- 
erts and Lauer, 1 979; Roberts, Kacich and Ptashne, 
1979a). More recently these methods have been ex- 
tended to produce large amounts of other procaryotic 
regulatory proteins (G. Lauer, A. Poteete and R. Brent, 
unpublished data). We have also described the pro- 
duction of a native eucaryotic protein in E. coli, the 
SV40 t antigen (Roberts et al., 1 979b). 

The essential features of our previously described 
methods were based upon the following considera- 
tions. Expression of a cloned gene requires that it be 
efficiently transcribed and that the mRNA so produced 
be efficiently translated. Transcription depends upon 
a promoter (such as the lac promoter) positioned in 
front of the gene. Efficient translation of a message in 
bacteria depends at least in part upon the presence in 
the mRNA of a "ribosome binding site." This site is 
thought to consist of a sequence at least 3-9 bp long, 
called the Shine-Dalgarno (SD) sequence, followed 
3-1 1 bp away by the AUG which encodes the amino 
terminal methionine of the protein (Shine and Dal- 
garno, 1975). The SD sequences are complementary 
to the 3' end of the 16S rRNA and probably promote 
binding by duplexing with that RNA (Steitz, 1979). 
The effects of sequences upstream from the SD se- 
quence in the leader or downstream from the AUG in 
the gene are less clearly understood. It is known, for 
example, that the base following the AUG can influ- 
ence binding to the ribosome (Taniguchi and Weiss- 
mann, 1 978). We have shown that, in two cases, the 
sequence of the leader can have a dramatic effect on 
translation (Backman and Ptashne, 1 978; Roberts et 
al., 1979a). 

These considerations led to the following strategies. 
We used a DNA fragment (a "portable promoter") that 
encodes the promoter of the lac operon and 35 bp of 
lac mRNA. This mRNA includes the lacZ SD sequence 
and the next five nucleotides but does not include the 
AUG that encodes the amino terminal methionine of 
/J-galactosidase. Using the appropriate nucleases, we 
positioned this portable promoter at many different 
places in front of the gene. For example, when posi- 
tioned 9 or 1 1 bp in front of the small f gene of SV40 
cloned on a plasmid, the plasmid directed the synthe- 
sis of relatively large amounts of small t protein. In 
each of these cases, we had formed a DNA fusion that 
encodes a hybrid ribosome binding site consisting of 
the SD sequence from the lac operon and the AUG 
encoded by the f gene. A similar hybrid ribosome 
binding site was used to express the A repressor 
efficiently (Backman and Ptashne, 1978); other inves- 
tigators used DNA synthetic techniques to express 
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the case of the X cro gene, the most efficient expres- 
sion was achieved using the cro gene ribosome bind- 
ing site. Even in the latter case, the distance of the 
promoter fragment from the gene profoundly influ- 
enced the level of protein produced (Roberts et af., 
1979a). 

An important limitation of our method described 
previously is the requirement for a functional or im- 
munological assay to detect those clones that had the 
promoter positioned at the optimal distance from the 
initiator ATG of the gene to be expressed. In some 
cases, identifying such clones may be laborious or 
impossible. (For example, affinity-purified antiserum 
to rabbit £-globin cross-reacts in a solid phase ra- 
dioimmune assay with at least one E. coli protein.) 
The chief advance we describe here enables us to 
express at high efficiency procaryotic or eucaryotic 
genes in the absence of assays for their gene prod- 
ucts. The procedure requires knowledge of the ap- 
proximate location of the ATG encoding the amino 
terminal methionine and the absence of intervening 
sequences in the gene to be expressed. In this paper, 
we describe these methods and show a particular 
application to the rabbit /8-globin gene. 

Results 

Experimental Design 

We wished to design a method whereby we could 
position our portable promoter fragment at an optimal 
distance from the 5' end of a gene in the absence of 
any assay for the gene product. We anticipate that in 
the case of eucaryotic genes, the optimal fusion 
would, in general, encode a hybrid ribosome binding 
site consisting of an SD sequence from the promoter 
fragment and the ATG from the gene. In the case of 
procaryotic genes, it may or may not be necessary to 
create such a hybrid ribosome binding site (compare 
Backman and Ptashne, 1978 and Roberts et al., 
1 979a). We have developed a method that uses lacZ 
gene fusions to enable us to identify readily clones 
bearing the promoter at the optimal distance from the 
gene. (See Bassford et al., 1978 and Ptashne et al., 
1 980 for examples of previous use of lacZ fusions.) 

We have designed a series of plasmids (pLG200, 
pLG300 and pLG400; see Appendix) each of which 
contains a carboxy terminal region of the lacZ gene 
fused to a fragment of the lacl gene. The lacZ frag- 
ment, which comprises most of the gene, encodes a 
protein fragment that has jS-galactosidase activity. 
This activity is not affected by additional protein se- 
quences at its amino terminus (see below). The l-Z 
fused gene is not expressed in these plasmids be- 
cause the lacl promoter and the end of the / gene 
have been deleted. Each of the plasmids bears a 
unique restriction site in the lacl portion of the lacl- 
lacZ fusion. When cut with the appropriate restriction 
enzyme, the lacl-lacZ hybrid gene is exposed in a 



different translational reading frame in the three plas- 
mids. 

Figure 1 outlines the three steps required to express 
efficiently a gene (called X) using one of the plasmids 
described above and in the Appendix. First, we insert 
a fragment of DNA that encodes the amino terminal 
region of the X protein into one of the pLG plasmids. 
The particular pLG plasmid used is chosen so that the 
translational reading frames of X and lacZ are in 
register (Figure 1 a). At this stage, the fused gene is, 
in general, neither transcribed nor translated. Second, 
using restriction digestion and appropriate nucleases, 
we position the portable promoter at varying distances 
from the ATG encoding the amino terminal methionine 
of the X protein (Figure 1 b). This procedure is facili- 
tated by the presence of a unique restriction site near 
the 5' end of the gene. If such a site is not present 
naturally, it can easily be introduced (Roberts et al., 
1 979a). Those fusions bearing the promoter at optimal 
distances from the amino terminal ATG are recognized 
as follows. Lac" bacteria are transformed with plas- 
mids bearing the portable promoter at varying dis- 
tances from the gene. Bacterial colonies, now Lac + , 
bearing the plasmids that direct the synthesis of the 
enzymatically active hybrid protein, are recognized by 
their color on the appropriate indicator plates. The 
intensity of this color is a rough measure of the relative 
efficiency of production of the hybrid protein. The 
third step in the construction is to reconstitute intact 
X by substituting for the lacl-lacZ portion of the fusion 
the carboxy terminal region of X (Figure 1 c). This can 
be done by using recombination in vitro or in vivo to 
generate the gene. The product of the reconstituted 
gene is readily visualized on gels using the maxi-ceil 
technique (Sancar, Hack and Rupp, 1979). 

A Bacterium Producing Rabbit jS-Globin 

Our source of the rabbit /J-globin gene was the plas- 
mid p/?G1 (Maniatis et a!., 1976), which encodes 
rabbit /?-globin cloned as a cDNA copy (see Figure 
2). We excised the gene using S1 nuclease and 
ligated Hind 111 linkers (Bahl etal., 1976; Shelleretal., 
1 977) onto the excised fragment. We then cut the 
fragment with Hind III and Bam HI (which recognizes 
a naturally occurring Bam HI site roughly two thirds of 
the way into the gene) and isolated a 330 bp amino 
terminal fragment containing 22 bp of the /?-g!obin 

"leader sequence and 301 bp of the ,6-globin coding 
sequence. This fragment was cloned into pBR322 
(Figure 2a) to form pGL6 and re-excised on a larger 
fragment by cleavage with Pst I and Bam HI (Figure 
2b). The latter fragment was then inserted into 
pLG300, producing a /?-globin-/acZ fusion. This step 
of the construction joined two Pst I ends and two Bam 
HI ends that had been rendered flush (Backman, 
Ptashne and Gilbert, 1 976). The preliminary cloning 
of the ^8-globin fragment into pBR322 facilitated the 
cloning into pLG300 because joining of the Pst I ends 

j Assigned Page #2125 
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a. I. Open pt-G at restriction site a 

2. Insert amino terminal fragment of gene X to make fused gene 




f=51 t^ p M . 
t T 

Pst PvaM 



b f. Open pLG' with restnction enzyme b 
£ Resect with nucleases for varying distances 

3. Cut pt_£ with f%f and insert Pst-PvuE promoter fragment 

4. Transform and select Lac* 




C I. Cut pLG" with e and d 

2. Replace c-d fragment with c-e fragment 1o reconstitute X 




Figure 1 . A General Method to Maximize 
Expression in E. coli of Any Procaryotic or 
Eucaryotic Gene 

(a) A fragment of DMA bearing the amino ter- 
minal region of gene X is inserted into restric- 
tion site a of a pLG plasmid, thereby fusing 
gene X to lacl-Z. In general, this fusion will 
occur by ligation of ends which have been 
rendered flush by ONA polymerase I {Backman 
etal., 1976). 

(b) Plasmid pLG' bearing the fused gene is 
opened at a unique restriction site b which 
precedes the ATG of the fused gene. Resec- 
tion and insertion of a portable promoter frag- 
ment that has a Shine-Dalgarno (SD) sequence 
is performed as described (Roberts et ai., 
1979a). Transformed clones that bear plas- 
mids that direct the synthesis of high levels of 
/?-galactosidase are identified as Lac + colo- 
nies on the appropriate indicator plates. 

(c) Gene X is reconstituted from plasmids that 
direct the synthesis of high levels of £-galac- 
tosidase as a fused product. This can be car- 
ried out, for example, by digesting the hybrid 
gene plasmid at any site (c) present in the 
gene X portion of the hybrid and at another 
site (d) in lacZ. Gene X is then reconstituted 
by the insertion of a DNA fragment that con- 
tains the carboxy terminal region of gene X 
extending from site c to a site <e) past the end 
of the gene. Gene reconstitution can also be 
achieved by in vivo genetic recombination. 
Note: Plasmids are not drawn to scale in this 
or subsequent figures. 



regenerates the amp gene. The structure of the re- 
sulting plasmid, pLG302, was confirmed by restriction 
enzyme analysis. 

The portable promoter was positioned at varying 
distances from the £-globin ATG as follows (Figure 
2c). pLG302 was cut with Hind III and resected with 
exonuclease III and S1 (Roberts et al., 1 979a) in some 
cases and with Bai 31 (Gray et al., 1975) in others, 
and the reaction products were cleaved with Pst. An 
excess of the portable promoter fragment bounded on 
one end by a Pst site and on the other by a flush Pvu 
II end was added, and the mixture was ligated. [This 
portable promoter is about 850 bp long. The lac SD 
sequence is 5 bp from the Pvu II end and is adjacent 
to the lac promoter as previously described (Backman 
and Ptashne, 1 978; Roberts et ai., 1 979a). Part of the 
amp gene is carried on the fragment, and joining g f oarc | 



the Pst ends of this fragment and pLG300 regenerates 
amp resistance. This promoter fragment is isolated 
from pGL101 (G. Lauer and M. Ptashne, unpublished 
results).] To identify those plasmids bearing the pro- 
moter at optimal distances from the 0-gJobin ATG, a 
Lac - strain of E. coli was transformed with the mixture 
and plated on lactose MacConkey indicator plates 
containing 15 fig/ml ampicillin. Colonies producing 
0-galactosidase as indicated by their characteristic 
red color on the indicator plates were picked and 
assayed for the enzyme. Clones producing interme- 
diate levels of /3-galactosidase [about 300-700 units 
(Miller, 1972)] are pale red, and those producing high 
levels (—1000 units) are dark red under these condi- 
tions (see Figure 2 legend). (We describe below and 
in Figure 2d the reconstitution of the jS-globin gene.) 

^ffieWSffi&fl®™*** directing high levels of 
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I Erase £-gjotin gene with SI and attach HindE linkers 
2. Cut exdsed fragment with Botn and HindM 

3 Clone amino terminal portion between HintfW and Bam sites in pBR322 




Cut pGL6 and pl£3O0 with 8am ond fill in ends 

2 Cat both ptemids with Pst 

3 Insert Pit-Bam fragment from pfi.6 into PLG30O 



Figure 2. Construction of Plasmids That Di- 
rect the Synthesis of High Levels of Rabbit 
/J-Globin in E. coli 

(a) Plasmid p/iG1 contains a cDNA fragment 
containing the rabbit /J-globin gene with a 5' 
leader region of 56 nucleotides and a 95 nu- 
cleotide region to the 3' side of the gene 
<Efstratiadis, Kafatos and Maniatis, 1977), 
which was cloned by the dA.dT tailing method 
(Jackson, Symons and Berg, 1972; Maniatis 
et al., 1976). We excised this fragment by 
selective degradation with S1 nuclease (Boer- 
inger, 50,000 U/ml) as described (Hofstetler 
et al., 1 976) and isolated a 525 bp fragment 
from a gel. Hind III linker oligonucleotides 
(Sheiler et al., 1977) were Hgated onto the 
ends of the fragment, and the molecule was 
cut with Hind. Ill and Bam HI. Bam HI cuts 298 
bp into the j8-globin coding sequence. A 330 
bp fragment containing a Hind ill end, a 22 bp 
leader region (as determined by DNA se- 
quencing) and 301 bp ot the /?-globin coding 
sequence were isolated from a gel and ligated 
with a purified pBR322 backbone fragment 
that had a Sam HI end and a Hind III end to 
yield pGL6. 

(b) pGL6 and PLG300 (see Appendix) were 
separately digested with 8am HI and the ends 
were rendered flush (Backman et al„ 1 976), 
and then both were digested with Pst I. A 
purified fragment containing the amino termi- 
nal region of 0-globin was ligated to a purified 
PLG300 fragment to construct the /J-globin- 
lacZ fusion and regenerate the amp gene. 
The resulting plasmid is pLG302. 

(c) pLG302 was opened by Hind HI, resected 
with Exo III and S1 (Roberts et al., 1979a) or 
Ba! 31 (Gray et al., 1975) and then digested 
with Pst I. Resection was monitored by exam- 
ining products at various times by electropho- 
resis on 3.5% acrylamide gels. A purified lac 
promoter fragment from pGL101 with a Pst I 
end and a Pvu II flush end was added in 
excess, and the mixture was ligated. An E. coli 
strain (NK5031) that bears a deietion of lacZ 
and synthesizes low levels of lac permease 
was transformed with the mixture and plated 
on lactose MacConkey indicator plates. 
Clones synthesizing ~300 units (Miller, 1 972) 
of ^-galactosidase are red on these plates. 
Plasmids from Lac* clones were purified and 
the DNA sequence around the start of the 
gene was determined. 

(d) A fragment with an Rl and Sau 3A end 
containing the lac promoter and the amino terminal two thirds of the jS-globin gene was purified from representative plasmids produced in the 
previous step. This fragment was ligated to a fragment with a Sau 3A end and an Alu I end containing the carboxy terminal third of (S-globin 
(isolated from p/3G1 ). Joining of the Sau 3A ends regenerates an intact /3-gfobin gene. A pBR322 fragment with an Rl end and a Pvu II end is also 
present in the ligation mix. A trimolecular ligation was performed which generated a pBR322 derivative containing an intact jS-globin gene 
preceded by the lac promoter fragment. 




Open pLG3C2 with HindM 
2 Resect with nucleases 

3. Cut with Pst and insert Pst-Pvul promote' fragment 
4 Transform and select Lrx* 





£'£-gobin | — | 
TulSa Atul 



1 Purify RI-SasfiG amino terminal £-glabin gene fragment from pi_G30£-l series 

2 Purify Soufflo-Aiul carbcwyl terminal /3-gloDin gene fragment from p>SC| 

3 Reconstitute ^-gtobin gene between PI and PvuH sites on pBR3?2 in a trimoleculrj' ligation 




/3-galactosidase synthesis fell into two classes. One 
class was generated by Bal 31 or Exo III + S1, 
appeared at a frequency of about 3% among trans- 
formed clones and produced about 1200 units of /?- 
galactosidase (for example, pl_G302-2). The second 
class was generated only by Exo III + S1, appeared 
at a frequency of about 0.5% and produced about 
2000 units of /S-galactosidase {for example, pLG302- 
3). This corresponds to 1 0,000-20,000 monomersP3g arc | 



protein per cell (I. Zabtn, personal communication). 
The DNA sequence was determined around the amino 
terminal coding region of six recovered plasmids. 
Three directed synthesis of high levels of /3-galacto- 
sidase (two from the first and one from the second 
class), and three directed synthesis of lower levels of 
j8-galactosidase. Figure 3 shows the relationship be- 
tween the position of the portable promoter and the 
A%'S®j^<PPfa^W2 i [%fc iase produced. The three hav- 
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Figure 3. DNA Sequence around the Start of the Hybrid Gene after Various Positionings of the Portable Promoter Fragment 
Depicted from left to right are the end of the lac promoter fragment (dashed line), the j8-globin leader region, the first 15 codons of the 0-globin 
gene and the distal iacZ fragment. In boxes are the Shine-Dalgamo (SD) sequence present on the promoter fragment and the ATG encoding the 
amino terminal methionine of £-globin. Arrows indicate the position of the lac promoter fragment in various plasmids, and the number ot units of 
/5-galactosidase directed by the plasmids is also shown. pLG302-1 was obtained by digesting pLG302 with Hind III, filling in the cohesive ends 
with DNA polymerase I and inserting the promoter fragment as described (Figure 2). The remaining plasmids were obtained by digesting pLG302 
with Hind III. resecting with Exo III and St (pLG302-3) or Bal 31 (pLG302-2, -29, -30, -34) and inserting the promoter fragment. Each sequence 
was determined by first digesting the plasmid with Rl and extending the exposed 3' ends with DNA polymerase and 32 P-labeled deoxyribonucleotide 
triphosphates. The DNAs were then cut with Pvu !l. which cuts in the lad interstitial fragment. The Rl Pvu II fragment bearing the lac promoter and 
the /3-globin gene fragment was isolated and sequenced. 



ing the highest /i-galactosidase levels (pLG302-2, -3 
and -29) ail had hybrid ribosome binding sites, and 
two of these had identical sequences. Of the other 
three, two had a large separation between the pro- 
moter fragment and the ATG, and one had been 
digested into the coding sequence. We have not de- 
termined where protein synthesis initiates in these 
latter three cases. 

The hybrid proteins encoded by the derivatives of 
pLG302 identified as described above were visualized 
by polyacrylamide gel electrophoresis (not shown). 
For example, when compared with the parental strain, 
a strain harboring pLG302-2 contained an additional 
protein of approximate molecular weight 130,000. 
This is in good agreement with the predicted molecular 
weight of a /3-globin-jS-galactosidase hybrid protein. 
In addition, the level of hybrid protein seen on the gel 
was roughly that expected for a strain making 1 200 
units of /?-galactosidase, assuming that the specific 
activity of the hybrid protein is roughly equivalent to 
that of wild-type ^-galactosidase. 

The intact giobin gene carrying the lac promoter at 
different distances from the start of the gene was 
reconstructed from plasmids bearing hybrid genes as 
shown in Figure 2d. A fragment carrying the lac pro- 
moter and the 5' portion of /8-globin was excised by 
Ri and Sau 3A. (note that Sau 3A cuts at the same 
position at which /8-globin was joined to pLG300.) 
This fragment was ligated to another containing the 3' 
end of the /?-globin gene bounded by a Sau 3 A site 
and an Alu site in the presence of a pBR322 fragment 
bounded by Rl and Pvu II ends. This three-piece 
construction requires two sticky end (RI-RI and Sau 
3A-Sau 3A) ligations and one flush end (Alu-Pvu II) 
ligation, and reconstitutes the /8-globin gene. Follow- 
ing ligation, amp-resistant transformants were se- 
lected (Backman et al., 1976), and following piasmid 



re isolation the construction was confirmed by restric- 
tion analysis. This j8-globin reconstruction was per- 
formed with three different plasmids (pLG302-1, 
pLG302-2, pLG302-3; see Figure 3) which bore the 
promoter at different positions and which synthesized 
different amounts of the hybrid j8-globin-/8-galacto- 
sidase protein as determined by enzyme assay. 

The production of native /?-globin in cells bearing 
these three plasmids was demonstrated in two ways. 
First, the maxi-cell technique (Sancar et al., 1979) 
showed that all these plasmids directed synthesis of 
a protein the size of ^-giobin (Figure 4). The amount 
of the presumed /i-globin made in each case was 
estimated by comparing, on polyacrylamide gels, the 
amount of radioactivity incorporated into the protein 
with that incorporated into a standard (see Figure 4 
legend). The amount of /8-globin made by the highest 
producer (pLG302-3) was estimated in this way to be 
about 1 0,000-1 5,000 monomers per cell, and that of 
the second highest producer (pLG302-2, -29) was 
estimated at about 5000-7500 monomers per cell. 
These values agree well with those predicted by the 
/S-galactosidase activities directed by the parental 
fusion plasmids. The /3-globin plasmid reconstructed 
from a low-level synthesizer of /S-galactosidase 
(pLG302-1) produced no detectable 0-globin. Sec- 
ond, a partial amino acid sequence of the protein 
encoded by one of the plasmids corresponded with 
that of authentic rabbit /?-globin (Dayhoff , 1 969). The 
protein was labeled with 3 H-leucine in one case and 
with 35 S-methtonine in another, extracted from gels 
and subjected to 35 cycles of Edman degradation 
(Figure 5). In rabbit 0-globin, there is no amino ter- 
minal methionine, and leucines are found at positions 
3, 14, 28, 31 , 32 . . . .In the labeled protein, leucines 
were found at positions 4, 15, 29, 32 and 33, and a 

rff^^r^m^Mtlg 081110 " 1 • This result shows 
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Figure 4. Polyacrylamide Gel Electrophoresis of Proteins Encoded 
by Plasmids Containing the 0-Globin Gene and Various Placements 
of the Portable Promoter Fragment 

Plasmid proteins were specifically labeled with 3 H-leucine by the 
maxi-cell technique (Sancar et al., 1979) in strain RB1 13 and ana- 
lyzed by PAGE according to the method of Laemmli (1970) on 1 3.5% 
acrylamide gels. The position of a rabbit ^-globin standard is shown. 
(1) PLG302-1R (reconstituted from PLG302-1); (2) pLG302-2R; (3) 
PLG302-3R; (4) pTR122. pTR122 makes about 40,000 monomers 
per cell of cro as determined by radioimmunoassay (Roberts et al., 
1979a; T. M. Roberts, unpublished data). The number of monomers 
of jg-globm made from various pLG derivatives was estimated by 
densitometry tracing of the autoradiogram and comparison with 
PTR122. The estimates are 5000-7500 monomers per cell for 
PLG302-2R and 10,000-15,000 monomers per cell for pl_G302-3R. 

that the protein is rabbit /3-globin plus an amino 
terminal methionine which is not removed in E. coli. 

Discussion 

The methods described here and in a preceding paper 
(Roberts et al., 1979b) enable us to place a portable 
promoter fragment at many different distances in front 
of a gene and to recognize readily those placements 

Board 
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Figure 5. Position of Leucines and Methionines in the Amino Terminal 
Region of Bacterial /i-Globin 

After maxi-cell labeling with either 3 H-Leu or 3S S-Met, the 14 kilo- 
dalton protein encoded by PLG302-2R was extracted from the gel 
and analyzed by automated Edman degradation. The product of each 
cycle was counted in a liquid scintillation counter. The small peaks of 
H-Leu at positions 9, 1 4 and 1 8 are presumably due to the presence . 
of a contaminating protein in the gel eluent. 

which elicit efficient transcription and translation of 
the gene. The method requires no assay for the gene 
product and can readily detect those plasmids bearing 
the promoter fragment at an optimal distance from the 
gene even if they occur rarely. For example, in the 
case we describe here, we positioned the portable 
promoter at varying distances in front of the ATG 
encoding the amino terminal methionine of rabbit p- 
globin. We immediately identified on the basis of /?- 
galactosidase activity a small fraction (0.5%) that we 
suspected had an optimal promoter placement. In 
these cases, we confirmed that the plasmid directed 
the synthesis of high levels of rabbit jS-globin when 
the gene was reconstituted. It is striking that our most 
efficient fusions selected only on the basis of jS-ga- 
lactosidase activity have positioned DNA encoding the 
Shine-Dalgarno (SD) sequence close to (8 and 1 0 bp) 
the ATG at the start of /5-globin. These results may 
be taken as further support for the idea of a ribosome 
binding site as first proposed by Shine and Dalgamo 
(1 975). 

We do not yet have a systematic survey of the effect 
of varying the distance between the SD sequence and 
the ATG; nor do we know the effect of varying the 
sequence of the interstitiai bases. Our strain produc- 
ing the highest levels of /?-globin makes about 
10,000-15,000 molecules per cell. This is about an 
order of magnitude less than that expected on the 
basis of the known strength of the lac promoter and 
the number of copies of Col E1 plasmids in cells. (See 
Backman and Ptashne, 1 978 for an argument that the 
lac promoter works as efficiently on a Col E1 plasmid 
as it does on the chromosome.) It remains to be seen 
whether sequences in the leader or within the coding 

signed Page #2129 
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region of the gene are limiting expression. We have 
begun an analysis of mutations that increase synthesis 
of /J-galactosidase from plasmids that encode hybrid 
genes (such as ^-globin-tecZ). We do not yet know 
which portion of the plasmid sequence or protein 
synthetic machinery these mutations will change. 

Our method wiil apparently work for expressing any 
procaryotic or eucaryotic protein beginning with its 
amino terminal methionine. In addition, our method 
may express shortened forms of the protein beginning 
at internal initiator codons. Finally, synthetic initiator 
codons can be attached within the gene to direct 
translation initiation, as has been reported in a pre- 
vious case (Qoedde! et at., 1979). In the two cases 
we have analyzed (yS-globin and SV40 1 antigen), the 
amino terminal methionine is not removed in E. coli 
(Roberts et al„ 1979b). 

Experimental Procedures 

Bacterial Strains and Enzyme Assay 

E. coli strains LG75 (F", facZvAsse). NK5031 (F", /acZAM5265, sulll 4 ". 
na/ R ), LG90 (F", A/aepraXltl), MM294 (endol rr, m K *) and RB1 13 
(F", recAl 3, uvrA6, thiA , thrl , leuBB, proA2, argEZ, his4, mtll , xyi5, 
tsx33, stiA3 1 . gaf2, ara 1 4, lac V\ ) were used. Assay of /3-galactosid- 
ase was performed as described (Miller. 1 972), except that minimal 
media were supplemented with 0.3% casamino acids. 

DNA Constructions 

Restriction enzyme digestions and use of DNA polymerase I and T4 
DNA ligase were as described (Backman et al., 1976), except that 
DNA polymerase I reactions were performed in the presence of 500 
pM dXTPs and ligations were performed in the presence of 600 
ATP. Use of exonuclease lit and S1 was as described (Roberts and 
Lauer, 1979). To monitor digestion from the Hind III site of pLG302, 
an aliquot of DNA was subsequently cleaved with Hint t, which cuts 
1 30 bp into the /?-gtobin coding sequence. Analysis of this digest on 
3.5% acrylamide gels allowed the extent of exonuclease III digestion 
to be approximated to an accuracy of about 1 0 bp. Bal 31 exonucle- 
ase digestion was performed as described (Gray et al.. 1 975), except 
the reaction was carried out at 20"C and 0.2 M NaCI in a 60 fil 
reaction volume containing 15 of DNA. Under these conditions, 
the enzyme removed about 1 0 nucleotides per mirt. The reaction was 
stopped with phenol. DNA sequencing was performed by the method 
of Maxam and Gilbert <1977). Synthetic DNA linkers were from 
Collaborative Research; Hind III linkers used in the pGL6 construction 
were a gift from R. Sheller. 

Radiolabeling of Proteins 

Plasmid-encoded proteins were labeled in strain RB1 1 3 by the maxi- 
cell technique (Sancer et al., 1979). Use of 35 S-methionine was as 
described (Brent and Ptashne, 1 980), while 3 H-!eucine labeling was 
performed as follows. A 10 ml culture of cells was grown, irradiated 
and shaken overnight as in the case of 3S S-methionine labeling. Cells 
were then spun and washed in M9 minimal medium containing all the 
amino acids except leucine. Cells were shaken in this medium for 1 
hr at 37°C (typically in a 1 ml volume), and 100 jiCi 3 H-leucine (13S 
Ci/mmole) were added. Labeling with 3= S-methionine or 3 H-leucine 
was performed for 1 hr, and cells were spun and resuspended in 
sample buffer (Laemmli, 1 970). Labeled extracts were run on 1 3,5% 
acrylamide gels for analysis (Laemmli, 1 970). 

Automated Amino Acid Sequence Analysis 

A 14 kilodalton protein was eluted from a 13.5% acrylamide gel 
(1 6,000 cpm of 3 H-leucine-labeled material and, separately, 2000 
cpm of 35 S-methionine-labeied material) and analyzed in a Beckman 
890 B automated sequencer using a 0.1 M Quadroi program ' E3r g^^|- ( j 



Margolies and Haber, 1 975). The radioactivity released at each cycle 
was measured in a liquid scintillation counter. Unlabeled apomyoglo- 
bin (200 fig) was added to the sample as an internal standard. 

Appendix: Construction of Plasmids for Making Iac2 Gene 
Fusions 

Leonard Guarente and Mark Ptashne 
Strategy 

We wished to construct plasmids that would facilitate fusing a DNA 
fragment encoding the carboxy terminal portion of lacZ with any DNA 
fragment encoding an amino terminal portion of another gene. The 
hybrid protein was to express j8-galactosidase activity. To do this, 
we exploited previous studies of lacZ gene fusion as follows. The first 
41 amino acids of jS-galactosidase may be removed without affecting 
enzymatic activity (Brickman et al., 1979). For example, one gene 
fusion was found to contain a large fragment of the lacl gene (355 of 
360 codons) but was missing the first 23 codons of lacZ (Figure 6; 
Brake et al., 1978). The encoded hybrid protein possessed both fac 
repressor and /i-galactosidase activities (Muiler-Hill and Kania, 1 974). 
We modified this lacl-lacZ hybrid gene so that DNA fragments en- 
coding the amino terminal portions of other proteins could be inserted 
precisely into the lacl portion. If the translational reading frame of the 
inserted gene fragment is in register with the lacl-lacZ fragment, a 
tripartite gene is formed. If this gene is transcribed and if translation 
initiates at the amino terminal codon of the inserted gene fragment, 
a tripartite protein is made consisting of the product of the inserted 
gene fragment at the amino terminus, a sequence of amino acids 
encoded by lacl and an active 0-galactosidase moiety at the carboxy 
terminus. Our plasmid constructions, described in detail below, con- 
sisted in outline of two steps. First, we constructed a plasmid con- 
taining the lacl-lacZ hybrid gene. Second, we replaced DNA encoding 
the promoter and amino terminus of the lacl gene with a variety of 
DNA linker oligonucleotides bearing sites sensitive to restriction en- 
zymes. When cut with the appropriate enzymes, each of the plasmids 
exposes the lacl-lacZ gene in a different translational reading frame. 
This enables us to insert a fragment of DNA encoding the amino 
terminus of any protein to form a tripartite hybrid gene which encodes 
a protein with jS-galactosidase activity. 

Plasmid Construction 

— (See Figure 6.) A DNA fragment containing an intact lacl-lacZ 
region was cloned into pBR322. As shown in the figure (a and b), this 
took place in two steps. /S-galactosidase production from strains 
bearing the resulting plasmid, pLG2, was induced 15 fold by the 
addition of 10~ 3 M IPTG to the growth medium. [IPTG (isopropyl- 
thiogalactoside) is a nonmetabolizable inducer of the lac operon.] A 
DNA fragment including the end of lacl and the beginning of lacZ was 
deleted from pLG2, thereby forming a lacl-lacZ hybrid gene. The 
deletion was introduced by genetic recombination between a lacZ 
derivative of pLG2 and an F factor carrying the 471 -56-1 4 lacl-lacZ 
fusion (see Figure 6; Brake et al., 1 978) (Figure 6c). Clones carrying 
pLG5 were constitutive for production of /J-galactosidase, which is 
expressed by transcription initiating at the lacl promoter and transla- 
tion initiating at the lacl amino terminal methionine codon. 
— (See Figure 7.) Convenient restriction sites were introduced into 
the lacl portion of pLG5 as synthetic DNA linkers (Sheller et al.. 
1 977). We introduced the 1 0 bp Hind III linker into a Pvu li site in lacl 
(Figure 7a). When the resulting plasmid, pLG200, is cut with Hind 111 
and the cohesive ends are filled in with DNA polymerase, the DMA 
encoding a portion of lacl and the carboxy terminal region of lacZ are 
exposed in a particular translational reading frame. Insertion of an 
amino terminal gene fragment with a transcriptional initiation site and 
a translational initiation site in the appropriate reading frame with 
respect to the lacZ gene fragment will result in the synthesis of /!- 
galactosidase as part of a tripartite polypeptide. To accommodate 
any of the three possible reading frames specified by a given amino 
terminal gene fragment, two additional derivatives of pLG200 were 
constructed. PLG200 was cut with Hind HI, the cohesive ends were 
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Figure 6. Construction of lacZ Fusion Plas- 
mids 

(a) Cloning a DNA fragment encoding the car- 
boxy terminus of /J-galactosidase. The lacZ 
transducing phage X1 1 2 (Maurer, Meyer and 
Ptashne, 1980} was digested with Kpn I, 
treated with S1 nuclease to give blunt ends as 
described (Roberts and Lauer, 1 979) and then 
treated with Eco RI. The fragment containing 
the carboxy terminus of lacZ was purified and 
ligated with a purified fragment of pBR322 
DNA containing RI and Pvu II ends. The ligation 
mix was used to transform strain LG90 (F~, 
Mac) to ampicillin resistance (Amp"). Plasmid 
DNA was isolated from several clones, and 
one containing the carboxy terminal fragment 
of lacZ (pLG1 ) was identified by analysis with 
restriction enzymes. 

(b) Cloning complete lad and lacZ. An RI 
fragment containing lacl, lacP and the amino 
terminal region of lacZ was purified from 
pMC4 (Calos, Johnsrud and Miller, 1978). 
This fragment was ligated with pLG1 DNA 
which had been cut with RI, and the ligation 
mix was used to transform LG90 to Amp" on 
plates containing XG and 10"* M IPTG. Blue 
colonies were purified, and plasmid DNA from 
one (pLG2) contained an intact tact-lacP-tacZ 
region. 

(c) Introduction of the lact-lacZ fusion. A 
lacZ' mutant of pLG2 was isolated by the 
phenotype of a colony bearing the plasmid on 
the indicator XG. (XG is a noninducing sub- 
strate for 0-galactosidase that is hydrolyzed 
to form the blue dye indigo.) This mutant 
proved to have acquired an IS1 insertion in 
lacZ (data not shown). The lacl-lacZ hybrid 
gene, hybrid 71 -56-1 4, was then crossed onto 
the lacZ' plasmid by recombination in vivo. 
This was accomplished by introducing 
pLG2facZ~ into a strain bearing F'facA71-56- 
14. After many generations of growth, the 
plasmid was reisoiated and introduced into a 
LacZ" strain LG90, Amp R JacZ + transformants 
were recognized on plates containing ampicil- 
tin and XG. The presence of the hybrid gene 
on a plasmid (pLG5) derived from one such 
clone was demonstrated by a 200 bp reduc- 
tion in the size of a Bel I fragment that spans 
the lacl-lacZ region. The hybrid gene has both 
lac repressor and /J-galactosidase activities 
(Muller-Hill and Kania, 1974), and it is made 
in relatively large amounts because its pro- 
moter is the mutant form / Q (Muter-Hill, Crapo 
and Gilbert, 1968). The dashed Pvu II frag- 
ment is described in the legend to Figure 7a. 



Sam HI linker was inserted to yield pLG300 (Figure 7b), This process 
destroys the Hind 111 site. Similarly, pLG300 was cut with Bam HI, the 
ends were filled in and a Hind III linker was inserted to give pLG400 
(Figure 7c). By cutting PLG200, pLG300 and PLG400 with the 
appropriate restriction enzymes and filling in the cohesive ends, lacZ 



can be fused in frame to any given amino terminal gene fragment 
having Hush ends. 

To verify that the above manipulations created the predicted DNA 
sequence, we constructed a fusion of the lacZ gene fragment of 
PLG400 to a Hae III fragment of the cl gene of phage A that encodes 
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I Partially digest pU35 with Pvu II 
2. Insert Hind M linker 




'HindM- 

[CC^6£ZTGG]CT(v 
HindM lac I 



! Cut with HindM 

2 Fili in with DNA polymerase 

3. Insert Barn linker 




CCAAGCT[CC^4ZSTGG] AGCTTGGCTG 
Bam lacl 

I 



!. Cut with Bam 

2 Fill in with DNA polymerase 

3 Insert HindM linker 




CCAAGCTCCGGATC[CC^gCr7-G g]g/l7rtr GGAGCTTGGCTG 
HindM {Bam) "loeiT 

Figure 7. Construction of lacZ Fusion Plasmids (Continued) 
(a) Introduction of linkers. pLG5 was partially digested with Pvu H and 
religated in the presence of Hind III linkers. Following religation and 
transformation, a plasmid bearing a Hind III linker in place of the Pvu 
II fragment indicated in Figure 6 by a dashed line was recognized as 
follows. The ligated DNA was introduced into LG75 (F~, /ac-Zam rn5a6 ). 
In this strain, the presence of the fee repressor supplied by the 
plasmid results in a Mellbiose" phenotype because it represses the 



all but the carboxy terminal six amino acids of A repressor <Sauer. 
1978) {Figure 8). The resulting plasmid, pLG490. conferred upon 
host clones immunity to phage X and also directed the synthesis of 
jS-galactosidase. These activities apparently reside on one hybrid 
protein with the X repressor fragment at the amino terminus and fi- 
galactosidase at the carboxy terminus. Clones bearing pLG490 pro- 
duce a large protein of approximately the expected size (see Figure 
9). 




HoeM 



1. Excise indicated HoeM fragment from pKB252 

2. Cut pLG400 with HindM 

3. Fill in HindM ends with DNA polymerase 

4. Insert HoeM fragment 




Figure 8. A cl-lacZ Fused Gene 

A Hae III fragment containing the first 231 (of 237) codons of wild- 
type eJ (Sauer, 1978) was isolated from pKB252 (Backman et al., 
1 976). This fragment also bears two lac promoters that direct tran- 
scription of the cl gene. The fragment was ligated with pLG400 DNA 
that had been cut with Bam HI and its ends rendered flush with DNA 
polymerase, and the ligation mix was used to transform LG100 (F~, 
A/ac) to Amp" on plates containing XG. Blue colonies appeared at a 
frequency of about 10%, and these colonies also acquired immunity 
to phage A. Plasmid DNA from one such clone had the anticipated 
structure. 



chromosomal lacY gene. Removal of the Pvu II fragment was expected 
to render the plasmid incapable of directing the synthesis of repressor 
or /J-galactosidase activities because the tacl° promoter and 5' end 
of tact would be missing. Still present, however, would be a region 
homologous to the portion of facZ bearing the amber mutation on the 
chromosome. Amp" clones were therefore screened for the following 
three phenotypes: LacZ" (on XG); Lacr (on Melibiose MacConkey 
indicator agar at 42°C); homology with fhe lacZ amber mutation (by 
recombination to LacZ + on lactose MacConkey indicator agar). DNA 
was prepared from candidate clones and one. pLG200, had the 
correct structure. pLG300 and pLG40Q were constructed as de- 
scribed in the text. 
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Figure 9. A X Repressor-;8-Galactosidase Hybrid Protein 
Cells from saturated cultures were tieated to 90°C for 3 min in sample 
buffer (Laemmli, 1970), and the extracts were electrophoreses in 
7.5% acrylamide gels (Laemmli, 1970>. The positions of ^-galacto- 
sidase and the putative hybrid protein are indicated. Note that strain 
LG101 is not isogenic to LG100. (1) LG100 (F~, A tec); (2) LG101 
(F". Mac/flacl', lacZ*)\ (3) LG100/PLG490. 
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FIELD OF THE INVENTION 

This invention relates to the microbial production, via recombinant 
DNA technology, of huiaan fibroblast interferon for use in the treatment 
of viral and neoplastic diseases, and to the means and end products of 
such production. ..... 
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BACKGROUND OF THE INVENTION • 

The publications and other materials referred to herein to illuminate 
the background of the invention and, in particular cases, to provide 
additional detail respecting its practice are incorporated herein by 
reference and, for convenience; are numerically referenced in the 
following text and respectively grouped in the appended bibliography. 

Recombinant ONA Technology 

Kith the advent of recombinant DNA technology, the controlled 
microbial production of an enormous variety of .useful polypeptides has 
become possible. Already in hand are bacteria modified by this 
technology to permit the production of such polypeptide products such as 
somatostatin, the (component) A and B chains of human insulin, human 
growth hormone. More recently, recombinant DNA techniques have been used 
to occasion the bacterial production of proinsulin, thymosin alpha 1, (an 
immune potentiating substance produced by the thymus) and leukocyte 
interferon. 

The workhorse of recombinant DNA technology is the plasraid, a 
non-chromosomal loop of double-stranded DNA found in bacteria and other 
microbes, oftentimes 1n multiple copies per cell. Included in the 
Information encoded in the plasmfd DNA is that required to reproduce the 
plasmid in daughter cells {i.e., a "replicon") and ordinarily, one or 
more selection characteristics such as, in the' case of bacteria, 
resistance to antibiotics which permit clones of the host cell containing 
the plasmid of interest to be recognized and preferentially grown In 
selective media. The utility of plasrafds lies' in the fact that they can 
be specifically cleaved by one or another restriction endonuclease or 
•restriction enzyme", each of which recognizes a different site on the 
plasmidic DNA. Thereafter heterologous genes or gene fragments may 6e 
inserted into the plasmfd by endwise joining at the cleavage site or at 
reconstructed ends adjacent to the cleavage site. DNA recombination is 
performed outside the cell, but the resulting "recombinant" plasraid can 
be introduced into it by a process known as transformation and large, 
quantities of the heterologous gene-containing recombinant plasmid 
obtained by growing the. transforraaht. Moreover, where the gene is 
properly inserted with reference to portions of the plasmid which govern 
the transcription and translation of the encoded DNA message, the 
resulting expression vehicle can be used to actually produce the 
polypeptide sequence for which the inserted gene codes, a process 
referred to as expression. 
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Expression is initiated in a region known as the promoter which is 
i recognized by and bound by RNA polymerase. In some cases, as in the 

tryptophan or "trp" promoter preferred in the practice of the present 
invention, promoter regions are overlapped by "operator" regions to fora 
a combined promoter-operator. Operators are DMA" sequences which are 
recognized by so-called repressor proteins which serve to regulate the 
frequency of transcription initiation at a particular promoter. The 
polymerase travels along the ONA, transcribing the information contained 
In the coding strand from its 5* to 3* end into messenger RIIA which is in 
turn translated into a polypeptide having the amino acid sequence for 
which the DMA codes. Each amino acid is encoded by a nucleotide triplet 
or "codon* within what may for present purposes. be referred to as the 
"structural gene", i.e. that part which encodes the amino acid sequence 
: of the expressed product. After binding to the promoter, the RNA 

polymerase first transcribes nucleotides encoding a Hbosonie binding 
Site, then a translation initiation or "start" signal (ordinarily ATG, 
which in the resulting messenger RNA becomes AUG), then the nucleotide 
codons within the structural gene itself. So-called stop codons are 
transcribed at the end of the structural gene whereafter the polymerase 
may fora an additional sequence of messenger RNA which, because of the 
presence of the stop signal, will remain untranslated by the ribosomes. 
Ribosomes bind to the binding site provided on the messenger RNA, in 
bacteria ordinarily as the mRNA is being formed, and themselves produce 
the encoded polypeptide, beginning at the translation start signal and 
ending at the previously mentioned stop signal. The desired product is 
produced if the sequences encoding the ribosome binding site are 
•positioned properly with respect to the AUG initiator codon and if all 
remaining codons follow the initiator codon in phase. The resulting 
product may be obtained by lysing the host cell and recovering the 
product by appropriate purification from other, bacterial protein. 

Fibroblast Interferon " . ' 

Human fibroblast interferon (FIF) is an -antiviral protein which also 
exhibits a wide range of other biological activities (see ref. 1 for " 
review). It has reportedly been purified to homogeneity as a single - 
polypeptide of 19,000-20,000 molecular weight having a specific activity 
of 2 to 10 X 10 8 units/mg (2,3). The sequence of the 13 NH 2 -terrainal 
amino acids of FIF has been determined (4). Houghton et al. (5) have 
used synthetic deoxyoligonucleotides (predicted from this amino acid 
sequence) to determine the sequence of the 276 5 '-terminal nucleotides of 
FIF mRNA. Taniguchi et al_. (6) and Derynck et a£. (7) have recently 
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employed RNA selection procedures to identify cloned cDNA copies of FIF 
mRMA in E. col i . See also Tantguchi et al . , Gene 10, 11 (1980) and Proc. 
Hatl. Acad. Sci. (U.S.A.)' 77. 5230 (I960) and Mature 285 , 547 (1980). 

While isolation from donor fibroblasts has provided sufficient 
material for partial characterization and limited clinical studies with 
homogeneous fibroblast interferon, it is a totally inadequate source for 
the amounts of interferon heeded for large scale clinical trials and for 
broad scale prophylactic and/or therapeutic use thereafter. Indeed, 
presently clinical investigations employing human fibroblast-derived 
interferon in antitumor and antiviral testing have principally been 
confined to crude (< 1 percent pure) preparations of the material, and 
long lead times for the manufacture of sufficient quantities, even at 
unrealistic price levels, have critically delayed investigation on. an 
expanded front. . - 

We perceived that application of recombinant ONA technology would be 
the most effective way of providing large quantities of fibroblast 
interferon which, despite the absence in material so produced of the 
glycosylatipn characteristic of human-derived, material, could be employed 
clinically in the treatment of a wide range of viral and neoplastic 
diseases. .. 

Hore particularly, we proposed and have since succeeded fn producing 
mature human fibroblast interferon microbi.ally, by constructing a gene 
therefor. which could then be inserted in microbial expression vehicles • 
and expressed under the control of microbial gene regulatory controls. 

Our approach to obtaining a fibroblast gene involved the following 
tasks: 

1. Partial amino acid sequences would be obtained by • 
characterization of fibroblast interferon purified to essential 
homogeneity, and sets of synthetic DNA probes constructed whose codons 
would, in the aggregate, represent all the possible combinations capable 
of encoding the partial amino acid sequences. 

2. Bacterial colony banks would be prepared containing cONA from 
induced messenger RNA. The probes of part (1) would be used to prime the 
synthesis of radio-labelled single stranded cDNA for use as hybridization 
probes. The synthetic probes would hybridize with induced mRNA as 
template and be extended by reverse transcription to form induced, 
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radio- labelled cDHA. Clones from the colony bank that hybridized to 
radio-labelled cDMA obtained in this manner would be investigated further 
to confirm the presence of a full-length interferon encoding gene. Any 
partial length putative gene fragment obtained would itself be used as a 
probe for the full-length gene. 

-3. The full-length gene obtained above would be tailored, using 
synthetic DNA, to eliminate any leader sequence that might prevent 
microbial expression of the nature polypeptide and to permit appropriate 
positioning in an expression vehicle relative to start signals and the ' 
ribosome binding site of a microbial promoter. Expressed interferon 
would be purified to a point permitting confirmation of its character and" 
determination of its activity notwithstanding the absence of 
glycosylation. ' . 



SWWRy OF IMVEHTIOH 

A series of replicable plasmidic expression vehicles have been 
constructed which direct the high level synthesis in transformant 
microorganisms of a mature polypeptide with the properties of authentic 
human fibroblast interferon. The product polypeptide exhibits the amino 
acid sequence of such interferon and is active In in vitro testing 
despite the lack of glycosylation characteristic of the human-derived 
material. Reference herein to the expression of "mature fibroblast 
interferon," connotes the bacterial or other microbial production of an -' 
interferon molecule unaccompanied by associated glycosylation and the ^i— 
presequence that immediately attends tnRNA translation of the human 
fibroblast interferon genome. Mature fibroblast interferon, according to-*^ 
the present invention, fs immediately expressed from a translation start 
signal (ATG) which also encodes the first amino acid codon cf the hatural > a^' 
product. The presence or absence of the methionine first amino acid in 
the microbially expressed product is governed by a kinetic' phenomenon 
dependent on fermentation growth conditions and/or levels of expression 
in the transformant host. Mature fibroblast interferon could be expressed 
together with a conjugated protein other than the conventional leader, 
the conjugate being specifically cleavable in an intra- or extracellular 
environment. See British Patent Publication Ho. 2007676A. Finally, the 
mature interferon could be produced in conjunction with a microbial 
"signal" peptide which transports the conjugate to the cell wall, where 
the signal is processed away and the mature polypeptide secreted. 



00831 -5- 



Board Assigned Page #2140 



• 1 I 

Figures 1 to 5 appended hereto are described in the detailed text 
infra . Figure 6 schematically depicts the construction of plasmids 
coding for the direct expression of mature fibroblast interferon. 

D 

Restriction sites and residues are as shown ("Pst 1", etc.). "Ap " and 
"Tc R " connote portions of the plasmid(s) which express, respectively, 
ampicillin and tetracycline resistance. The legend "p o" is an 
abbreviation for "promoter operator." 

DESCRIPTION OF THE PREFERRED EMBODIMENTS 

A. Microorganisms Employed 

The work described Involved use of the microorganism: E. coli K-12 
strain 294 {end A, thi-, hsr-, hsn£), as described in British Patent 
Publication No. 2055382 A./ l TM$-s-tfa in has been dept)! ,! led W l III Uie 
ito^a«-4ype-- Cu 1 t u re - Collation; ATCC Accessi o n H o . 3144G; All 
recombinant DNA work was performed in compliance with applicable 
guidelines of the National Institutes of Health. 

The invention, in its most preferred embodiments, is described with 
reference to E. coli , including not only strain E. coll K-12 strain 294, 
defined above, but also other known IE. coli strains such as E. coll B, 
E. coli x 1776 and I. coli W 3110, or other microbial strains many of 
which are deposited and (potentially) available from recognized 
microorganism depository institutions, such as the American Type Culture 
Collection (ATCC)— cf. the ATCC catalogue listing. See also German 
Offenlegungsschrift 2644432. These other microorganisms include, for 
example, Bacilli . such as Bacillus subtilts and other enterobacteriaceae 
among which can be mentioned as examples Salmonella typhimurium and 
Serratia marcesans, utilizing plasmids that can replicate and express 
heterologous gene sequences therein. Yeast, such as Saccharomyces \ 
cerevisiae, may also be employed to advantage as host organism in the 
preparation of the interferon proteins hereof by expression of genes 
coding therefor under the control of a yeast promoter. {See the 
copending U.S. patent application of Hitzeman et a]_,, filed February 25, 
1981 (Attorney Docket No. 100/43), assignee Genentech, Inc. et ah, which 
is incorporated herein by reference.) 

MATERIALS ADD METHODS 

General methods. 

Restriction enzymes were purchased from New England Biolabs and used 
0083L ' -6- . 
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as directed. Plasraid D.VA was prepared by'a standard cleared lysate 
procedure {8} and purified by column chromatography on Biogel A-50M 
(Dio-Rad). DMA sequencing was performed using the method of Haxam and 
Gilbert (9). DNA restriction fragments were isolated from polyacryl amide 
gels by electrocution. DNA fragments were radiolabeled for use as 
hybridization probes by the random calf thymus DHA priming procedure of 
Taylor et aK (10). In situ colony hybridizations were performed" by the 
Grunstein-Hogness procedure (11). 

Chemical synthesis of deoxyoligonucleotides. 

The deoxyoligonucleotides were synthesized by the modified 
phosphotriester method in solution (12), using trideoxynucleotldes as 
building blocks (13). The materials and general procedures were similar 
to those described (14). The six pools of primers (Fib 1-6} containing 
four dodecanucleotides each were obtained by separately coupling two 
hexamer pools (of two different S^terminal sequences each) with three 
different hexamer pools (of two different 3»-terminal sequences each). 

Induction of fibroblasts. 

Human fibroblasts (cell line GM-2504A) were grown as described 
previously (15). Growth medium. (Eagle's minimal essential medium 
containing 10 percent fetal calf serum) was removed from roller bottles 
(Corning, 850 cm 2 ) and replaced with 50 ml growth medium containing 50 
ng/nil of poly(I):poIy(C) (PL Biochemical s) and 10 ng/ral cyclohexiraide. ■• 
This induction medium was removed after 4 hours at 37*C and cell 
monolayers were washed with PBS (0.14M MaCl. 3mM KC1, 1.5 mH KH 2 P0 4 , 
8mM Na 2 HP0 4 ). Each bottle was incubated at 37*C with 10 ml of a 
trypsin - EDTA solution (Gibco 610-5305) until" cells were detached, and 
fetal calf serum was added to a concentration of 10 percent^ Cells were 
spun for 15 minutes at 500 x g and pellets were resuspended in PBS, v 
pooled, and resedimented. Cells were frozen in liquid nitrogen. 
Approximately 0.17g of cells were obtained per roller bottle. 

Preparation and assay of interferon mP.MA. 

PolyfAj-containing nRHA was prepared from human fibroblasts by phenol 
extraction and oligo(dT)- cellulose chromatography as described elsewhere 
(16). The poly (A) containing RMA was enriched for interferon mRHA by 
centrifugation on a linear 5 percent to 20 percent (w/v) sucrose 
gradient. The RHA samples were headed to 80*C for 2 minutes, rapidly 
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cooledj layered over the gradient, and centrifuged for 20 hours at 30,000 
rpra at 4*C in a Beckman SW-40 rotor. Fractions were collected, ethanol 
precipitated, and dissolved in H 2 0. 

One microgram samples of mRNA were injected into Xenopus laeyis 
oocytes as described previously (17,18). The injected oocytes were 
incubated 24 hours at 2i*C, homogenized, and centrifuged for 5 minutes at 
10,000 x g. The interferon* in the supernatant was determined by the 
cytopathtc effect (CPE) inhibition assay {1) using Sindbis virus and 
human diploid (WISH) cells. Interferon titers of 1,000 to 6,000 units 
recovered (NIH reference standard) per microgram of RNA injected were 
routinely obtained for the 12S species of mRNA.. 

Synthesis and cloning of cDNA. 

Single stranded cDNA was prepared in 100 jil reactions containing 5 »g 
of 12S fraction mRNA, 20 mM Tris-HCl (pH 8.3), 20 mM KC1 , 8raM MgCl 2 , 30 
vH B-mercaptoethanol, 100 uCi of (a 32 P)dCTP (Amersham) and lmM dATP, 
dCTP, dGTP, dTTP. The primer was the synthetic Hind HI decamer 
dCCAAGCTTGG (19), which had been extended at the 3' terminus with about 
20 to 30 deoxythymidine residues using terminal deoxynucleotfdyl 
transferase (20). 100 units of AMY reverse transcriptase were added and 
the reaction mixture was incubated at 42*C for 30 minutes. The second 
strand DNA synthesis was carried out as described previously (21). The 
double stranded cDNA was treated with 1200 units of SI nuclease (Miles 
Laboratories) for 2 hours at 37*C in 25 mM sodium acetate {pH 4.5), ImM - 
ZnCljj, 0.3M NaCl. After phenol extraction the mixture was separaied 
electrophoretically on an 8 percent polyacryl amide gel. cDNA (" 0.5"ng) 
ranging from 550 to 1500 base pairs in size was recovered by electro- 
elution. A 20 ng aliquot was extended with deoxyC residues using . 
terminal deoxynucleotidyl transferase (20), and annealed wtft LOO ngVof 
pBR322 which had been cleaved with' Pst I and tailed with deoxyG residues 
(20). The annealed mixture was used to transform E. coli K-12 strain 294 
(22) by a published procedure (23).. Strain 294 was used throughout in 
the work described here, and has been deposited with the American. Type 
Culture collection, accession no. 31446. 

Preparation of induced and uninduced 32 P-cDNA probes. 5 v g of 12S 
mRNA were combined with either 2 (ig of oligo (°7)i2_i8 (Collaborative 
Research) or 5 m3 of each synthetic primer pool (Fib 1 to Fib 6) in 60 |il 
of lanH Tris-HCl (pH 8), 1 mM EDTA. The mixtures were boiled 3 minutes, 
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and quenched on ice. 60 M 1 of 40 mM Tris-HCl (pll 8.3), 40 mM KC1, 16mM 
MgCl 2 , 60 uiH B-mercaptoethaitol, 1 mM dATP, dGTP, dTTP and 5 x 10' 7 H {«- 32 P 
dCTP (Araersham, 2,000 - 3,000 Ci/romole) was added to each tempi ate-pn'mer 
mix at 0*C. After the addition of 100 units of AW reverse transcriptase, 
the reactions were incubated at 42*C for 30 minutes and purified by 
passage over 10 ml Sephadex G-50-colunns. The' products were treated with 
0.3N HaOH for 30 minutes at 70*C, neutralized, and ethanol precipitated. 

32 * 
The P-cDHAs were confined with lOOpg of poly(A) mRNA from 

uninduced fibroblasts in 50 ?! of 0.4M sodium phosphate (pH6.8), 0.1 

percent SOS. The mixtures were heated at 98*C for S minutes and allowed 

to anneal 15 hours at 45*C. The DNA-RHA hybrids, (containing uninduced 

cOHA sequences) were separated from single-stranded DHA (induced cOHA 

sequences) by chromatography on hydroxyapatite as described by Galau 

et al_- (24). The DNA-RNA hybrids were treated with alkali to remove RNA. 

Screening of recombinant plasmids with 32p. c pHA probes. 

Approximately 1 pg samples of plasraid DNA were prepared from 
individual transforraants by a published procedure (25). The DMA samples 
were linearized by digestion with Eco RI, denatured in alkali, and 
applied to each of three nitrocellulose filters (Schleicher and Schuell, 
BA85) by the dot hybridization procedure (26). The filters were 
hybridized with the 3Z P-cOMA probes for 16 hours at 42*C In 50 percent 
foraamide, lOx Oenhardt's solution (27), 6xSSC r 40 mM Tris-HCl (pH 7.5), 
2mH EDTA, 40 ug/ml yeast RNA. Kilters were washed with O.lxSSC, 0.1 ;. 
percent SOS twice for 30* at 42*C, dried, and exposed to Kodak XR-2 x-ray 
film using Dupont Lightning-Plus intensifying screens at -80*C. 

Construction of plasmids for direct expression of FIF. 

The synthetic primers I (dATGAGCTACAAC) and II (dCATGAGCTACAAC) were 
phosphorylated using T4 polynucleotide kinase and ( T - 3 ?P)ATP (Aniershami 
to a specific activity of"700 Ct/mmole as described previously (28). 
Primer repair reactions were performed as follows: 250 pmoles of the ■ 
32 P-primers were combined with 8 t>9 (10 pmole) of a 1200 bp Hha I 
restriction fragment containing the FIF cDMA sequence. The mixture was 
ethanol precipitated, resuspended in 50 |il h^O, boiled 3 minutes, 
quenched in a dry ice-ethanol bath, and combined with a 50 |il solution of 
20nM Tris-HCl (pH 7;5),' 14 utt MgClj. 120 mM Had, 0.5 mM dATP,- dCTP, 
dGTP, dTTP at 0*C. 10 units of DHA polymerase 1 Klenow fragment • 
(Boehringer-Mannheira) were added and the mixture was incubated at 37'C 
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for 4 }/2 hours. Following extraction with phcnol/CHCIj and 
restriction with £st I, the desired product was purified on a 6 percent 
polyatryl amide gel. Subsequent ligations were done at room temperature 
(cohesive termini J or 4*C {blunt ends! using previously detailed 
conditions (21,28). 

Assay for interferon expression in E. coli. 

Bacterial extracts were prepared for IF assay as follows: One ml 
cultures were grown overnight in LB {29} containing 5 ng/ml tetracycline, 
then diluted into 25 ml of H9 medium {29} containing 0.2 percent glucose, 
0.5 percent casamino acids and 5 nS/ml tetracycline. 10 ml samples were 
harvested by centrifugation when Aggg (Absorbance at 550 nanometers} 
reached 1.0. The cell pellets were quickly frozen in a dry ice-ethanol 
bath a.nd cleared lysates were prepared as described by Clewell (8). 
Interferon activity in the supernatants was determined by comparison with 
HlH FIF standards using cytopathic effect (CPE) inhibition assays as 
reviewed previously (1). Two different assays were. used: (a). WISH 
(human amnion) cells were seeded in microtiter dishes. Samples were 
' added 16 to 20 hours later and diluted by serial 2-fold dilution. 
Sindbis virus was added after at least 3 hours of incubation. Plates 
were stained 20 to 24 hours later with crystal violet, (b) HOBK (bovine 
kidney) cell lino was seeded simultaneously with 2-fold dilutions of 
samples. Vesicular stomatitis virus was added after 2 to 3 hours 
incubation and plates were stained with crystal violet 16 to' 18 hours 
later. To test pH 2 stability bacterial extracts and standards were 
diluted in minimal essential medium to a concentration of 1000 units/ml. 
One ml aliquots were adjusted to pH 2 with IN HC1, incubated at 4*C for 
16 hours, and neutralized by addition of RaOH. IF activity was 
determined by the CPE inhibition assay using human amnion cells. To 
establish antigenic identity 25 ul aliquots of the 1000 U/mT- interferon 
samples (untreated) were incubated with 25 ul of rabbit antihuman 
leukocyte interferon for 60' at 37*C, centrifuged at 12,000 x g for 5 
minutes and the supernatant assayed. Fibroblast and leukocyte Interferon 
standards were obtained from the National Institutes of Health. Rabbit 
antihuman leukocyte interferon was obtained from the National Institute 
of Allergy and Infectious Diseases. 

RESULTS . • 

Chemical synthesis of primer pools complementary to FIF mRNA. 

The amino-terainal protein sequence of human fibroblast interferon 
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(4) permitted us to deduce the 24 possible mRMA sequences which could 
code for the first four amino acids. The 24 complementary deoxyoligo- 
nucleotides were synthesized in 6 pools of 4 dodecamers each (Figure 1). 

The six pools of 4 deoxyoligonucleotides each were synthesized by a 
modified phosphotriester method- that has been used previously for the 
rapid synthesis of oligonucleotides in solution (12) and on solid -phase 
(14). The basic strategy involved- reacting two different 3'-blocked 
triners with an excess of a single 5'-protected trimer to yield a pool of 
two hexamers, each represented equally. The coupling of two pools, each 
containing two hexamers, then resulted in a pool of four dodecamers. 

Identification of FIF cDHA clones . 

Using IZS tnRNA from induced human fibroblasts (1,000 units IF 
activity per ug in oocyte assay), -double stranded cDHA was prepared and 
inserted into pBR322 at the Pst I site by the standard dG:dC tailing 
method (20). A fibroblast cDNA library, consisting of 30,000 arapicillin- 
sensitive, tetracycline-resistant transformants of JE. coli K-12 strain. 
294 was obtained from 20 ng of cDNA ranging in size from 550 to 1300 base 
pairs. Plasraid DNA was prepared from 600 of the transformants and applied 
to 3 sets of nitrocellulose filters as described in Materials and Methods. 

The approach followed in the identification of hybrid plasmids 
containing fibroblast interferon cDNA sequences was similar to that used 
to identify human leukocyte interferon recombinant plasmids (30). 
Radiolabeled cDNA' hybridization probes were prepared using either the 24 
synthetic dodecamers or oligofdTjjg.jg as primers and-12S RNA from 
induced fibroblasts (5000 units/ug in oocytes) as template. The 32 P- 

8 

cDHAs (specific activity >5 x 10 cpsi/tig) obtained were hybridized to a 
large excess of mRMA isolated from uninduced human fibroblasts, and. the 
aRHA-cDNA hybrids were separated from unreacted cDHA by hydroxyapattte 
chromatography (24). The single stranded cONA fractions should be 
enriched for sequences which are present in induced fibroblasts but 
absent in uninduced cells, and the mRNA-cDMA hybrids should represent 
sequences common to both induced and uninduced cells. Approximately . 
4 x 10^ cpm of single stranded cOMA (hybridization probe A) and 8 x 
10** cpm of cDtlA-mRNA hybrids were obtained using oligo(dT)j 2 _ 18 
primed cDHA; 1.5 x 10 6 cpm of single stranded (hybridization probe B) 
and 1.5 x 10** cpm of hybrids were obtained from cOkA primed using 
synthetic dodecamer pools Fib 1-6. The cDNA-raRNA hybrids from both 
fractionations were combined, the RNA hydrolyzed by treatment with 
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alkali, and the P-cDMA used as hybridization probe C. Many of the 
600 plasmid samples hybridized with both probes A and C, indicating that 
the hybridization reactions between uninduced raRNA and J P-cDMA (prior 
to the hydroxyapatite fractionation step) had not gone to completion. 
However, only one of the 600 plasmids (pF526) hybridized strongly with 
the specifically primed, induced cDNA probe B (Figure 2). Plasmid pF526 
also..hybridized with the total oligo{dT)j 2 _ 18 primed, induced cDMA- 
probe A, and failed to give, detectable hybridization to the combined 
uninduced probe C. 

Pst I digestion of pF526 showed the cloned cDHA insert to be about 
550 base pairs long, probably too short to contain the entire coding 
region for a protein the size of fibroblast interferon- Therefore, a 
P-labeled DNA probe was prepared from this Ps£ I fragment by random 
priming with calf thymus DNA (10). This probe was used to screen 2000 
individual colonies from a newly constructed fibroblast cDNA library (the 
new cDNA library was prepared using 12S mRNA from induced fibroblasts 
having a titer of 6,000 units/ml .in the oocyte assay system). Sixteen 
clones hybridized to the probe. Plasmids prepared from the majority of 
these released two fragments when cleaved with Pst I, indicating that the 
cDNA contained an internal Pst I site. Clone pFIF3 contained the largest 
cDHA insert, about 800 base pairs. The DNA sequence of the insert was 
determined by the Haxam-Gilbert procedure (9) and is shown in Figure 3. 
The amino acid sequence of human fibroblast interferon predicted from the 
nucleotide sequence is identical to that' repotted recently by Taniguchi 
et a]_. (31) and by Derynck et al_. (7) from DBA sequencing of FIF cDNA 
clones. A precursor or signal peptide of 21 amino acids is followed by a 
mature interferon polypeptide of 166 amino acids, a stretch of 196 
3* -untranslated nucleotides and a poly.(A) tail. The NH^- terminal 20 
amino acids of mature FIF have been directly determined by protein 

microsequencing and are the same as those predicted from the, DMA sequence. 

. V 

Direct expression of fibroblast interferon. 

To express high levels of mature fibroblast interferon in E. coli , 
initiation of protein synthesis must occur at the ATG codon of the mature 
polypeptide (amino acid 1) rather than at the ATG of the signal peptide 
(amino acid SI) (Figure 3). 

Our approach to removing the signal peptide coding regions from pF!F3 
is depicted in Figure 4. A 1200 bp DNA fragment which contained the 
entire cDNA insert was isolated from a polyacry-1 amide gel after digesting 
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pFIF3 with Hha I. Two separate synthetic deoxyoligonucleotide primers, 

dATGAGCTACAAC { I ) and dC ATGAGCTACAAC (II), were prepared. Both primers 

contain the coding sequence for the first four amino acids of mature 

fibroblast interferon; primer II has an additional C at the 5 '-terminus. 

Primer repair reactions and subsequent ligations were carried out 

separately for primers I and II, and gave nearly identical results. 

Therefore, only reactions using primer I are discussed fn detail here. 

3? 

The primers were 5'-radiolabeled using P)ATP and T4 polynucleotide 
kinase, combined with the 1200 bp Hha I DNA fragment and the mixture 
denatured by boiling. Following hybridization of the primer to the 
denatured Hha_ I DNA fragment, E. coli DNA polymerase I Kl enow fragment 
{33} was used to catalyze the' repair synthesis of the plus (top) strand 
(Figure 4). In addition, the associated, 3*>5' exomiclease activity of 
the Klenow fragment removed the 3'-protruding end from the minus (bottom) 
strand., leaving a flush end. Analysis of samples of the reaction mixture 
by polyacryl amide gel electrophoresis indicated that the repair synthesis 
did not go to completion, but. stopped at several discrete sites. 
Therefore, the entire reaction mixture was treated with Pst I and the 
desired 141 bp fragment (180,000 Cerenkov cpm; "0.3 pmole) was purified, 
by polyacryl amide gel electrophoresis (Figure 5). Ligation of this 
fragment to 1 \>g ("4 pmole) of the 363 bp Pst I -Bgl II fragment Isolated 
from.pFIF3 (Fig. 4), followed by BoJ II digestion, yielded 50,000 
Cerenkov cpm ("0.1 pmole, "30 ng) of the 504 bp DNA fragment containing 
the entire coding sequence for mature fibroblast interferon. The same 
reactions using primer II gave 83,000 cpm ("0.15 pmole, "50 ng) of 505 bp 
product* ... 

The construction of plasmids which direct the synthesis of human 
fibroblast interferon is outlined In Figure 6. Separate- expression 
plasmids were constructed which placed FIF synthesis under the control of 
the E_. coli 1 ac or trp promoter-operator systems. Both of these systems 
have proven useful for the direct expression of eukaryptic genes fn \ 
E. coli : human growth hormone has been efficiently synthesized using the 
lac system (21) and human leukocyte interferon has been produced at high 
levels using the trp system (30) and Mature 287, 411 (1980). 

pBRH trp was digested with EcoRI restriction enzyme and the resulting 
fragment isolated by PAGE and electroelution. EcoRI-digested plasmid 
pSom 11 (K. Itakura et al_.. Science 198 , 1055 (1977); G.B. patent 
publication no. 2 007 676 A) was combined with the above fragment. The 
mixture was ligated with T 4 OHA ligase as previously described and the 
resulting DNA transformed into E; coll K-12 strain 294 as previously 



00831 -13- 



Board Assigned Page #2148 



described. Trans formant bacteria were selected on ampicill in-containing 
plates. Resulting ampicillin-resistant colonies were screened by colony 
hybridization (H. Gruenstein et aU, Proc Nat'l Acad Sci USA 72, 
3951-3965 [1975]) using as a probe the trp promoter-operator-containin'g 
the above fragment isolated from pBRHtrp, which had been radioactively 
labelled with p^„ Several colonies shown positive by colony 
hybridi2aticn were selected, plasmid DNA was isolated and the orientation 
of the inserted fragments determined by restriction analysis employing 
restriction enzymes Bglll and BamHl in double- digestion. E. coli 294 
containing the plasraid designated pS0M7a2, which has the trp 
promoter-operator fragment in the desired orientation was grown in LB 
medium containing 10 ng/rol arapicillin. The cells were grown to optical 
density 1 (at 550 nH), collected by centrifugation and resuspended in m 
media in tenfold 'dilution. Cells were grown for 2-3 hours, again to 
optical density 1, then lysed and total cellular protein analyzed by SDS 
(sodium dodecyl sulfate) urea (15 percent) polyacryl amide gel 
electrophoresis (J.V. Haizel Jr. et aU, Meth Viral 5, 180-246 [3971]). 

Plasraid pBR322 was Hind III digested and the protruding Hind III ends 
in turn digested with SI nuclease. The SI nuclease digestion involved 
treatment of 10 yg of Hind Ill-cleaved pBR322 in 30 til SI buffer (0.3 M 
NaCl, 1 mM ZnCl 2 , 25 mM sodium acetate, pH 4.5) with 300 units SI 
nuclease for 30 minutes at I5*C. The reaction was stopped by tlie 
addition of 1 fl of 30 X SI nuclease stop solution (0.8M tris base, 50 mM 
EDTA). The mixture was phenol extracted, chloroform extracted and 
ethanol precipitated, then EcoRI digested as previously described and the 
large fragment UJ. obtained by PAGE procedure followed by 
electroelutfon. The fragment obtained has a first EcoRI sticky end and a 
second,' blunt end whose coding strand begins with the nucleotide 
thymidine. 

Plasmid pSoi»7 42, as prepared above, was Bgl II digested and the^Bgl 
II sticky ends resulting. made double stranded with the Klenow polymerase 
I procedure using all four deoxynucleotide triphosphates. EcoRI cleavage 
of the resulting product followed by PAGE and electroelution of the small 
fragment J2) yielded a linear piece of DNA containing the tryptophan 
promoter-operator and codons of the IE' "proximal" sequence upstream from 
the Bgl II site ("LE'lpl"). The product had an EcoRI end and a blunt end 
resulting from filling in the Bgl II site. However, the Bgl II site is 
reconstituted by ligation of the blunt end of the above fragment (2) to 
the blunt end of the above prepared fragment (1) . Thus, the two 
•fragments were- liga ted in the presence of T. DNA ligase to form the 
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recirculated plasraid pHKY 10 which was propagated by transformation into 
competent E. coli strain 294 cells. 



Plasmfd pGHl carries the E . coli tryptophan operon containing the 
deletion alE1413 (G.F. Miozzari, et al.. (1978)' J. Bacteriology 133, 
1457-1466)} and hence expresses' a fusion protein comprising the first 6 
amino acids of the trp leader and approximately the last third of -the trp 
E polypeptide (hereinafter ^referred to in conjunction as IE'), as well as 
the trp D polypeptide in its entirety, all under the control of the trp 
promoter-operator system. The plasraid, 20 ng, was digested with the 
restriction enzyme PvuII which cleaves the plasraid at five sites. The 
gene fragments were next combined with EcoRI linkers {consisting of a 
self complementary oligonucleotide of the sequence: pCATGMTTCATG J 
providing an EcoRI cleavage site for a later cloning into a plasmid 
containing an EcoRI site. The 20 pg of DNA fragments obtained from pGMl 
were treated with 10 units T 4 DNA ligase in the presence of 200 pfco 
moles of the 5'-phosphorylated synthetic oligonucleotide pCATGMTTCATG 
and in 20 w l T 4 DNA ligase buffer {2CmM tris, pH 7.6,. 0.5 mM ATP, 10 mM 
HgCl 2 , 5 nfi dithiothreitol) at 4*C overnight. The solution was then 
heated 10 minutes at 70*C to halt ligation. The linkers were cleaved by 
EcoRI digestion and the fragments, now with EcoRI ends were separated 
using 5 percent polyacryl amide gel electrophoresis (hereinafter "PAGE") 
and the three largest fragments .isolated from the gel by first staining 
with ethidi urn bromide, locating the fragments with ultraviolet light, and 
cutting from the gel the portions of interest. Each gel fragment, with 
300 microliters O.lxTBE, was placed in a dialysis bag and subjected to 
electrophoresis at 100 v for one hour in O.lxTBE buffer (TBE buffer 
contains: 10.8 gm tris base, 5.5 gm boric acid, 0.09 gm Na 2 EDTA in 1 
liter H 2 0). The aqueous solution was collected from the dialysis bag, 
phenol extracted, chloroform extracted and made 0.2 M sodium chloride, 
and the DMA recovered in water after ethanol precipitation. The trp 
promoter-operator-containing gene with EcoRI sticky ends was identified 
in the procedure next described, which entails the insertion of fragments 
into a tetracycline sensitive plasmid which, upon promoter-operator 
insertion, becomes tetracycline resistant. 

Plasmid pBRHl {R.i. Rodriguez, et al., Nucleic Acids Research 6, 
3267-3287 [1979]} expresses ampicilin resistance and contains the gene 
for tetracycline resistance but, there being no associated promoter, does 
not express that resistance. The plasmid is accordingly tetracycline 
sensitive. By introducing a promoter-operator system in the EcoRI site, 
the plasmid can be made tetracycline resistant. 
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pBRHl was digested with EcoRl and the enzyme removed by phenol 
extraction followed by chloroform extraction and recovered in water after 
ethanol precipitation. The resulting DNA molecule was, in separate 
reaction mixtures, combined with each of the three DMA fragments obtained 
above and ligated with DNA ligase as previously described. The DNA 
present in the reaction mixture was used to transform competent E. colt 
K-12 strain 294, K. Backroan "et aT ., Proc Hat'l Acad Sci USA 73 , 4174-4198- 
[1976]) by standard technicjues {V. Hershfield et aK, Proc Nat'l Acad Sci 
USA.71_. 3455-3459 [1974]} and the bacteria plated on LB plates containing 
20 ug/ml ampicillin and 5 pg/ml tetracycline. Several tetracycline- • 
resistant colonies were selected, plasmid DNA Isolated and the presence 
of the desired fragment confirmed by restriction enzyme analysis. The 
resulting plasmid is designated pBRHtrp.. 

' An EcoRI and BamHI digestion product of the viral genome of hepatitis 
B was obtained by conventional means and cloned into the EcoRI and Bamlll 
sites of plasmid pGH6 {D.V. Goeddel et aj,., Nature 281. 544 [1979])) to 
form the plasmid pHS32. Plasmid pHS32 was cleaved with Xbal, phenol 
extracted, chloroform extracted and ethanol precipitated. It was then 
treated with'l nl E. coll polymerase I, Klenow fragment (Boehringer- 
Hannheim) in 30 til . polymerase buffer {50 mM potassium phosphate pH 7.4, 
7mH HgCl 2 , 1 mM s-mercaptoethanol ) containing O.lmM dTTP and O.lmM dCTP 
for 30 minutes at 0*C then 2 hr. at 37*C. This treatment causes 2 of the 
4 nucleotides complementary to the 5' protruding end of the Xbal cleavage 
site to be filled in: 

5' CTAGA 5' CTAGA 

3' T 3' TCT 

Two nucleotides, dC and dT, were Incorporated giving an end with two 
5' protruding nucleotides. This linear residue of plasmid pHS32 (after 
phenol and chloroform extraction and recovery in water after ethanol 
precipitation) was cleaved with EcoRI. The large plasmid. fragment was 
separated from the smaller EcoRI-Xbal- fragment by PAGE and isolated after 
electroelution. This DNA fragment from pHS32 (0.2 vg), was ligated, - 
under conditions similar to those described above, to the EcoRI-Taq I 
fragment of the tryptophan operon ("0.01 |ig), derived from pBRHtrp. 

In the process. of Hgating the fragment from pHS32 to the Eco RI-Taq 
I fragment, as described above, the Taq I protruding end is ligated to 
the Xbal remaining protruding end even though it is not completely 
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Watson-Crick base-paired: 

— T + • CTAGA TCTAGA 

AGC TCT — > AGCTCT 

A portion of this ligation reaction mixture was transformed into E. 
coli-294 cells, heat treated and plated on LO plates containing 
araplciTTin. Twenty-four colonies were selected, grown in 3 ml LB media, 
and plasmid isolated. Six of these were found to have the Xbal site 
regenerated via E. coli catalyzed DMA repair and replication: 

TCTAGA TCTAGA 

AGCTCT * AGATCT; — 

These plasraids were also found to cleave both with EcoRI and Hpal and 
to give the expected restriction fragments. One plasmid, designated pTrp 
14, was used for expression of heterologous polypeptides, as next 
discussed. 

The plasaid pHGH 107 (0.V. Goeddel et al, Mature , 281 , 544, 1979) 
contains a gene for human growth hormone wade up of 23 amino acid codons 
produced from synthetic DNA fragments and 163 amino acid codons obtained 
from complementary DNA produced via reverse transcription of human growth 
hormone messenger RNA. This gene, though it lacks the codons of the 
"pre" sequence of human growth hormone, does contain an ATG translation 
initiation codon. The gene vras Isolated from 10 ug pHGH 107 after 
treatment with EcoRI followed by E. coli polymerase I Klenow fragment and 
dTTP and dATP as described above. Following phenol and chloroform 
extraction and ethanol precipitation the plasmid was treated with BamHI. 

The human growth hormone ("HGH") gene-containing fragment was 
Isolated by PAGE followed by electroelutlon. The resulting DMA fragment 
also contains the first 350 nucleotides of the tetracycline- resistance 
structural gene, but lacks the tetracyline promoter-operator system so 
that, when subsequently cloned into an expression plasmid, plasraids " 
containing the insert can be located by the restoration of tetracycline 
resistance. Because the EcoRI end of the fragment has been filled in by 
the Klenow polymerase I procedure, the fragment has one blunt and one 
sticky end, ensuring proper orientation when later inserted into an 
expression plasmid. 

The expression plasmid pTrp!4 was next prepared to receive the HGH 
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gene-containing fragment prepared above. Thus, pTrpl4 was Xbal digested 
and the resulting sticky ends filled in with the Klenow polymerase I 
procedure employing dATP, dTTP, dGTP and dCTP. After phenol and 
chloroform extraction and ethanol precipitation the resulting DMA was 
treated with BaraHI and the resulting large plasmid fragment isolated by 
PAGE and electroelution. The pTrpl4-derived fragment had one blunt and 
one"sticky end, permitting recombination in proper orientation with the . 
HGH gene containing fragment previously described. 

The HGH gene fragment and the pTrp!4 4Xba-BamHI fragment were 
combined and ligated together under conditions similar to those described 
above. The filled in Xbal a'nd EcoRI ends ligated together by blunt end 
ligation to recreate both the Xbal and the EcoRI site: 

Xbal filled in EcoRI filled in HGH gene initiation 

TCTAG 

AGATC 



This construction also recreates the tetracycline resistance gene. 
Since the plasmid pHGH 107 expresses, tetracycline resistance from a 
promoter lying upstream from the HGH gene {the lac promoter), this 
construction, designated pHGH 207, permits expression of the gene for 
tetracycline resistance under the control of the tryptophan 
promoter -opera tor. Thus the ligation mixture was transformed into E. 
coli 294 and colonies selected on LB plates containing 5 ng/ml 
tetracycline. 

Plasmid pHGH 207 was EcoRI digested and the trp promoter containing 
EcoRI fragment recovered by PAGE followed by electroelution. Plasmid 
pBRHl was EcoRI digested and the cleaved ends treated with bacterial 
alkaline phosphatase ("BAP") (l»g, in 50 mH tris pH 8 and 10 mM MgCl 2 
for 30 rain, at 65*C) to remove the phosphate groups on the protruding • 
EcoRI ends. Excess bacterial alkaline phosphatase was removed by phenol 
extraction, chloroform extraction and ethanol precipitation. The 
resulting linear DNA, because It lacks phosphates on the protruding ends 
thereof, will in Ligation accept only inserts whose complementary stick 
ends are phosphorylated but will not itself recircularize, permitting 
more facile screening for plasmids containing the inserts. 
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The EcoRI fragment derived from pHGH 207 and the linear DMA obtained 
from pBRHl were combined in the presence of ligase as previously 
described and ligated. A portion of the resulting mixture was 
transformed into E. coli strain 294 as previously described, plated on LB 
media containing 5 »g/ml of tetracycline, and 12 tetracycline resistant 
colonies selected. Plasmid was isolated from each colony and examined 
for the presence of a DMA insert by restriction endonuclease analysis 
employing EcoRI and Xbal. 4 One plasmid containing the insert was 
designated pHKYl . 

The plasmid pHKYlO, described above, is a derivative of pBR322 which . 
contains a Bgl_ II site between the tetracycline resistance (Tc R ) 
promoter and structural gene (32). The large D)%'A fragment isolated after 
digesting pHKYlO with Pst I and Bgl_ II therefore contains part of the 
ampicill in resistance (Ap_ R ) gene and all of the Tc R structural gene, 
but lacks the TC R promoter (Fig 6). The plasmid pGH6 (21) was digested 
with Eco RI, the resulting single stranded ends were filled 1n with DNA 
polymerase I, and the plasmid was cleaved with Pst I. The small 
fragment, containing part of the Ag* gene, a double lac promoter and 
lac ribosome binding site, but lacking an ATG initiation triplet was 
Isolated. A similar _tr£ promoter fragment, containing the trp_ leader 
ribosome binding site, but lacking an ATG sequence (30), may be Isolated 
from pHKYl described above; see (32) (see Figure 6). 

The trp_ fragment just referred to Is an analog of the E. coll 
tryptophan operon from which the so-called trp_ attenuator has been 
deleted, See J. Bact. 133, 1457 (1978), to controllably heighten 
expression levels. Expression plasmids containing the modified trp - 
regulon can be grown to predetermined levels In nutrient media containing 
additive tryptophan in quantities sufficient to repress the promoter- 
operator system, then be deprived of tryptophan so as to dVrepress the 
system and occasion the expression of the intended product. 

The expression plasmids may be assembled via three part ligation 
reactions as shown in Figure 6. 15 ng {".05 pmole) of the assembled FIF 
gene (504 or 505 bp), 0.5 ng ("0.2 pmole) of the large Pst I - Bgl_ II 
fragment of pHKYlO and 0.2 pg ("0.3 pmole) of the appropriate promoter 
fragment were ligated and the mixture used to transform E. coli 294 (22). 
Plasmid DNA was prepared from individual transfonnants and analyzed by 
restriction mapping. Correct joining of the assembled gene to the 
promoter fragment should restore the Eco RI (lac ) or the Xba I (trp ) 
recognition sequences. The majority of the plasmids gave the expected 
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restriction enzyme digestion patterns. Individual clones (12 containing 
the trp promoter and 12 containing the lac promoter) were grown and 
extracts prepared for interferon assay as described in Materials and 
Methods. 

When assayed on human amnion (WISH) cells for antiviral activity by 
the CPE inhibition assay (1) five of the trp transforroants were positive 
(each approximately equivalent); eleven of. the lac transforraants gave 
equivalent IF activities. Therefore, one transformant from each series 
(pFIF)ac9 and pFIFtrp69) was selected for further study (Table 1). DMA 
sequence analysis demonstrated that the desired attachment of promoter to 
FIF structural gene had occurred In both cases.' 



Table I . Interferon activity In extracts of E. coll 



E. coli K-12 
strain 294 
transformed by: 


Cell density 
(cells/ml) . 


IF Activity 
(units/1 culture) 


FIF molecules 
per cell 


pBR322 


3.5 X 108 






pFIF1ac9' 


3.5 x 108 


9.0 x 106 


2.250 


pFIFtrp69 


3.5 x 108 


1.8 x 10 7 


4,500 


pFIFtrp369 


3.5 x 108 


8.1 x 10? 


20,200 



Cells were grown and extracts prepared as described in Materials and. 
Methods. The human amnion (WISH) cell line was used for the CPE 
inhibition assay Tl). Activities given are the average for three * 
independent experiments. To determine the number of IF molecules per 
cell a FIF specific activity of 4 x 10 8 units/mg was used (2). 



The amounts of fibroblast interferon produced by pFIFlac9 and 
pFIFtrj>69 are shown In Table 1. The trp_ promoter gave a FIF. expression 
level measurably higher than did the -lac promoter. In an attempt to 
further increase FIF expression levels, pFI Ftrp6 9 was" cleaved with Eco RI 
and two 300 base pair Eco RI fragments containing the trp promoter (30) 
were inserted. The resulting plasmid, pFIFtrp 3 69, contains three 
successive trp_ promoters which read toward the FIF gene. The amount of 
FIF synthesized by E. coli K-12 strain 294/pFIF trp 3 69 is 4-5 times 
that produced by pFIF trp_ 69 (Table 1). This is apparently due to the 
derepression of the trp promoter which occurs when trp repressor levels 
are titrated by the multiple copies of the Jtrp_ operator. 

The FIF produced by E. coli K-12 strain 294/pFIFtrp69 behaves like 
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authentic human FIF. As shown in Table 2, its antiviral activity is 
about 30 tiroes greater on human cells than bovine cells. In addition, 
the bacterially produced FIF is stable to treatment at pH Z overnight and 
is not neutralized by rabbit antihuman leukocyte interferon antibodies 
{Table 3). 

Table 2 . Interferon activities measured on different cell types " 





Interferon Activity (units/ml) 


Cells 


LeIF 


FIF 


E. coli K-12 strain 
B47pF7Ftrp69 extract 


Human amnion 

) 


20,000 


10,000 " 


1280 


Bovine kidney 


13,000 


400 


40 



10,000 units/ml respectively. Assays were performed as described in 
Materials and Methods. . 

Table 3 . Comparison of activities of extracts from E. coli K-12 

strain 294/pFIFtrp69 with standard human leukocyte and 
fibroblast interferons 

j nterferon Activity {units/mTT 



E. coli K-12 strain 
LeIF . FIF 2947pTlFtrp69 



untreated 1000 1000 1000 • 

pH2 •. 1000 ' 1000 1000 

rabbit antihuman <16 1000 1000 

LeIF antibodies 

Experimental procedures described in Materials and methods. Assayed by 
CPE inhibition using WISH cells/Sindbis virus. 

Purification \ 

The purification procedure for bacterial derived fibroblast is as 
follows: 

1. Frozen cells are suspended in tvfelve times volume per weight with 
sucrose lysis buffer {lOCbiH Tris-HCl, 10 percent sucrose, 0.2H NaCl, 
50mM EDTA, G.2mH PHSF, pH 7.9} containing lysozyme at Img ml" 1 . 
The ceil suspension is stirred for 1 hour at 4*C and centrifuged. 
Fibroblast interferon activity remains in the supernatant. 
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2. Pol/othyleneimine (5 percent v/v) is added to the sonicated 
supernatant to a final concentration of 0.5 percent (v/v). The 
solution is stirred for 1 hour at 4'C and centrifuged. Interferon 
activity remains in the supernatant. 

3. Solid ammonium sulfate is added to the polyethyleneimine supernatant 
to a final concentration of SO percent saturation, stirred for <30 
minutes at 4*C and centrifuged. Interferon activity is in the 50 
percent pellet. 

4. The 50 percent ammonium sulfate pellet is suspended in one half the 
volume of the 50 percent "ammonium sulfate suspension with Phosphate 
Buffered Saline (20 mM sodium phosphate 0.15M NaCl, pH 7.4). 
Polyethylene glycol 6000 (50 percent w/v in PBS) is added to a final 

• concentration of 12 1/2 percent (v/v)," stirred at 4*C for 2 hours and 
centrifuged. Interferon activity is in the pellet. The pellet is 
suspended in a minimal volume of sucrose lysis buffer and clarified 
by centrifugation. 

This initial extraction procedure results in a purification of 
fibroblast interferon from 0.001 percent of the total protein to 0.05 
percent of the total protein. This material can be further purified 
to homogeneity by the following cplumn chromatography steps: 

5. Afinity chromatography on Amicon Blue B in sucrose lysis buffer. 

6. Anion exchange chromatography on QAE Sephadex in sucrose lysis buffer 
in the absence of 0.2H NaCl. . 

7. Size exclusion chromatography on Sephadex G-75 in sucrose lysis 
buffer. ; \ 

8. Reverse phase high pressure 11 quid" chromatography. 
Parenteral Administration 



FIF may be parenterally administered to subjects requiring antitumor 
or antiviral treatment. Dosage and dose rate may parallel that currently 
in use in clinical .investigations of human derived materials, e.g., about 
(1-10) x 10 6 units daily, and in the case of materials of purity 
greater than 1 percentage, likely up to, e.g., 150 x 10 6 units daily. 
Dosagesof bacteriaTly obtained FIF could be significantly elevated for 
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greater effect owning to the essential absence of human proteins other 
than FIF, which proteins in ffbroblast-derived materials- may act as 
pyrogens, exhibiting adverse effects, e.g., malaise, temperature 
elevation, etc. 

As one example of an appropriate dosage fom for essentially 
homoaeneous bacterial FIF in parenteral form, 3 rag. FIF of specific 
activity of, say, 2 x 10 8 tymg may be dissolved in 25 ml. 5 percentage 
serum albumin (human) - USP, the solution passed through a bacteriological 
filter and the filtered solution aspetically subdivided into 100 vials, 
each containing 6 x 10 6 units pure interferon suitable for parenteral 
administration. The vials are preferably stored in the cold {-20°C) 
prior to use. 

The compounds of the present invention can be formulated according to 
known methods to prepare pharmaceutically useful compositions, whereby 
the polypeptide hereof is combined in admixture with a pharmaceutical ly 
acceptable carrier vehicle. Suitable vehicles and their formulation are 
described in Remington's Pharmaceutical Sciences by E.W. Martin, which is 
hereby incorporated by reference. Such compositions will contain an 
effective amount of the interferon protein hereof together with a 
suitable amount of vehicle in order to prepare pharraaceutically 
acceptable compositions suitablefor effective administration to tile 
host. One preferred mode of administration is parenteral. 
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1. A polypeptide comprising the amiwac'id sequence of a mature human^^/ 
fibroblast interferon, microbiauyvprodueed and unaccompanied by any 
corresponding presequence or^orfion thereof. 

2. .A polypeptide according tgJcjUira 1, unaccompanied by associated 
glycosylation. '* \ Ja3& 

3. The polypeptide according to Claim_l, optionally containing the amino 
acid methionine as the ordinarily first amino acid of said inte/reron. 

4. The polypeptide according to Claim_l, optionally containino/a 
cleavable conjugate or microbial signal protein attached/to the 
N-terainus of the ordinarily first amino acid of said/fnterferon. 



5. A ONA sequence comprising a sequence coding for ine polypeptide 
according to Claims 1, 3 orJU— — / 53<~ 

6. The DMA sequence according to Claim 5 oaefably linked with a DNA 
sequence capable of effecting mi crobia/i expression of a polypeptide 
according to Claims 1, _3_ox-A. 

7. A replicable microbial express ionAehicle capable, in a transfonnant 
microorganism, of expressing/a polypeptide according to .Claims I, 3 
or 4. / • ' ^"43C 



.8. A microorganism transformed with the vehicle- according to Claiin_ZA_ 

'. ■ 

9. The microorganism according to Claim J ^-obtained by transforming an *?>3 
E. coli strain. 



10. A plasmid selected from the group consisting of pFlF lac 9, pFlF trp 
69, and pFIi/trp 3 69. 



11. A transformed microorganism selected from the group consisting of an 
JE. coli/ strain transformed independently with each of the plasmids 
according to Claira-10. 

12. A transformed microorganism according to Clai m 1 In here in said 
U coll strain is E. coli K-12 strain 294. 
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13. A composition of matter comprising a therapeutically active fraction 
.of a polypeptide consisting essentjially of the amino acid sequence of 
a nature human fibroblast interferon, the balance of said composition 
comprising soluble microbial protein from which said polypeptide may 
be purified to a degree sufficient for effective therapeutic 
application. 

14. A bacterial extract comprising greater than about 95 percent pure 
polypeptide consisting essentially ofi the amino acid sequence of a 
mature fibroblast interferon according to Claims 1, 2, 3, or 4. 



15. A composition comprising a therapeutically effective amount of a 



mature human fibroblast interferon aco 
suitable for pharmaceutical adminis^r^t 



16. The composition according to Claim 15 
■ administration. 



siitable for parenteral 



17. A culture of microbial cells capable or 
interferon in mature form. 



to Claims 1, 3 or 4, 



iroducing a human fibroblast 



18. The use of a mature human fibroblast inte-feron according to Claims 
•I, 3 or 4, for antitumor or antiviral treatment or for preparing 
pharmaceutical compositions useful for sucn treatment. 



19. A process which comprises microbial ly expressing a human ffbcefflast 

Interferon in mature form. t-fi)^ 



20. A process which comprises microbially express 
according to Claims 1, 3 or,4-. 



a polypeptide 



according to Claims l.».jjjpjL 



21. A process for producing a polyp 

comprising causing a cuHjspeof a microorganism, transformed with a 
replicable microbial expression vehicle capable of expressing said 
polypeptide, to gpotf up and effect expression of said polypeptide and 
recovering saJ4^polypeptide. 



22. 



A process for producing an expression vehicle according to Claim 7 
comprising contracting a first DMJL-sequence coding for said 
polypeptide (defined in Claim 7) abdoperably linking said first ONA 
sequence with a second DMA sequencWcapable of effecting microbial 
expression of said first DMA sequende. 
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ABSTRACT 



A cDHA library is constructed using mRHA from human fibroblasts 
induced with poly(I):po1y{C). A bacterial clone contaning fibroblast 
interferon cDNA sequences identified by hybridization to a cDNA probe 
synthesized using deoxyoligonucleotide primers which hybridize to 
fibroblast interferon mRHA specifically. Expression plasmids are 
constructed which permit the synthesis in E. coli of 8 x 10 7 units of 
human fibroblast interferon per liter of culture. The bacterially 
produced fibroblast inteferon is indistinguishable from authentic human 
fibroblast interferon by several critieria. 
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Wilmington, Delaware 19898 
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California Institute of Technology 
Pasadena, California 91125 



The chemical characterization of interferons is necessary in order to under- 
stand its actions at the molecular level. Characterization has been slow since only 
microgram quantities have been available of any interferon. Interferons from 
human fibroblast cells, 1 ' 2 human lymphoblastoid cells, 3 human leukocytes, and 
mouse cells 5 " 8 have recently been purified to homogeneity and some limited 
structural information has been reported. W0 A major goal of the structural studies 
in interferons is the determination and comparison of amino acid sequences. 

We report in this paper an improved procedure for the purification of human 
fibroblast interferon (HF-IF) that has been used to provide interferon for amino 
acid sequencing. Using an automated protein microsequencmg technique 
previously described," 12 we have determined the sequence of the 13 ammo acid 
residues at the N-terminus of the HF-IF prepared by this method. We also report 
an amino acid composition. 

Human diploid fibroblast cells (FS-4) were cultured and HF-IF was produced 
as previously described. 1 Interferon was assayed by a microassay technique, 
with vesicular stomatitis virus as the challenge virus. Interferon units are given 
in National Institutes of Health human fibroblast interferon units. 

The crude HF-IF, 10-15 liters produced in the absence of serum, is made 1 M 
in NaCl and passed at room temperature through a column (4 x 10 cm) of Blue 
Sepharose (Pharmacia, Inc.) equilibrated with 0.02 sodium phosphate buffer, pH 
7.2, containing 1 M NaCl. The HF-IF is retained while greater than 95% of the 
total protein passes through the column. The interferon is eluted with a 1:1 
mixture of the column buffer and ethylene glycol, and each fraction is diluted 
immediately with 0.5 volume of the buffer (Figure la). Fractions- containing 
interferon activity are pooled, diluted with two volumes of the column buffer, 
and passed through a small (1x6 cm) Blue Sepharose column for concentration. 
The interferon is eluted as described above (Figure lb). 

Fractions containing HF-IF are pooled, dialyzed against 1 mM Tns-HCl, pH 
6.8, containing 0.02% sodium dodecyl sulfate (SDS, BioRad electrophoresis 
grade), and concentrated to dryness in a vacuum centrifuge. The HF-IF is then 
subjected to electrophoresis on a SDS-polyacrylamide slab gel and the HF-IF 
eluted as previously described 1 (Figure 2a). Fractions eluted from the gel are 
assayed for interferon activity (Figure 2a). Approximately 0.2 M g of HF-IF from 
the peak activity fraction is subjected to electrophoresis in this system again, and 
the gel is stained with Coomassie Blue (Figure 2b). 

385 

0077-8923/80/0350-0385 $01.75/0 © 1980. NYAS 



Board Assigned Page #2171 



Sugano EXHIBIT 2053 

Sugano v. Goeddel 

Interference No. 105,334 and 105,337 



386 



Annals New York Academy of Sciences 



3X10 6 




12 16 4 
FRACTION No. 

Figure 1. fa) Fractionation of crude interferon on a large column of Blue Sepharose 
Elution of interferon with 50% ethylene glycol in column buffer begins at fraction 1. 
(b) Small Blue Sepharose column. Fractions 7-17 in (a) were pooled, passed through the 
small column, and eluted with 50% ethylene glycol in column buffer (fractions 1-20) 




12 16 
FRACTION No. 



vl ?Z ,1 Pre P a , ra ! ,ve electrophoresis of interferon, activity profile. Fractions 6-15 in 
Figure lb were pooled concentrated, and subjected to electrophoresis in a polyacrylamide 
slab gel. 075 mm thick. Fractions 14 and 15 {Figure 2a) were pooled and processed for 
ammo acid sequencing. 
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Figure 2b. Polyacrylamide slab gel, staining of proteins eluted from preparative gel in 
Figure 2a. Approximately 2% of the protein in fractions 14 and 15 (Figure 2a) was 
subjected to electrophoresis and stained. 



The preparative electrophoresis fractions containing HF-IF are pooled and 
centrifuged for 30 min at 30,000 rpm, 4°C, to remove polyacrylamide gel 
particles. The interferon solution is dialyzed first against 0.15 M NaCl containing 
0.1% SDS and then against 0.02% SDS, and it is concentrated to dryness in a 
vacuum centrifuge. 

This purification procedure is simpler and shorter than that described 
previously. 1 Recoveries from the large Blue Sepharose column have ranged from 
50-100%, and those from the small column approach 100%. The interferon 
(5 x 10 7 units/mg) eluted from these columns is stable for at least four weeks at 
4°C in 1 M NaCl, 35% ethylene glycol, pH 7.2. Recoveries of activity from the 
SDS gels have ranged from 5-20%, and specific activities of this protein have 
ranged from 2-9 x 10 8 units/mg. Accurate specific activities are difficult to 
determine, and two- to fourfold differences above 1 x 10 8 units/mg are probably 



Board Assigned Page #2173 



388 Annals New York Academy of Sciences 

Table 1 



Amino Acid Composition of Human Fibroblast Interferon 





Mol % 


Residues per 
20,000 Daltons 


Asp 


11.1 


18.9 


Thr 


4.0 


6.8 


Ser 


6.2 


10.5 


Glu 


15.9 


27.0 


Pro 


1.6 


2.7 


Gly* 


4.6 


7.8 


Ala 


5.9 


10.0 


Cysf 


1.0 


1.7 


Val 


3.5 


6.0 


Met 


1.7 


2.9 


lie 


5.3 


9.0 


Leu 


12.0 


20.4 


Tyr 


4.4 


7.5 


Phe 


5.5 


9.4 


His 


2.9 


4.9 


Lys 


6.8 


11.6 


Arg 


6.4 


10.9 


Trpt 


0.6 


1.0 



"Includes correction for free glycine present in unhydrolyzed protein. 

fDetermined after performic acid oxidation. 

^Determined after hydrolysis with mercaptoethane-sulfonic acid. 



not meaningful. Overall yields of purified HF-IF from 10-15 liter batches of 
crude material (5-7 x 10 7 total units, 8 x 10" units/mg) have averaged around 10 
percent. This gives 5-20 fig of homogeneous interferon. 

Amino acid analysis on 1-2 ng aliquots was performed on a Durrum D-500 
ammo acid analyzer (Table 1). Automated Edman degradation on 0.4-2 fig 
aliquots of the purified interferon was performed in a spinning cup sequentator 
as previously described."- 12 Phenylthiohydantoin (Pth) amino acids were identi- 
fied by high performance liquid chromatography on a Du Pont Zorbax GN 
column." 

The sequence of the N-terminal 13 amino acid residues of human fibroblast 
interferon determined by this microsequencing technique is given in Figure 3. 
Yields of Pth-Met at cycle 1 for three sequenator runs ranged from 60-100% 
(based upon protein quantitation by amino acid analysis), and the sequenator 
repetitive cycle yields were 92-95 percent. Any unblocked minor peptide 
sequence present at >5% of the reported sequence could have been detected by 
the methods used, but none has been observed. This result, coupled with the high 
initial Pth yields, confirms the homogeneity of the HF-IF polypeptide prepara- 



1 b n 

Met - Ser - Tyr - Asn - Leu - Leu - Gly - Phe - Leu - Gin - Arg - Ser - Ser 



Figure 3. The N-terminal amino acid sequence of human fibroblast interferon. 
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This paper reports the amino acid composition and the N-terminal sequence 
of 13 amino acid residues of human fibroblast interferon. 15 It is essential to 
determine the amino acid sequences of the interferons in order to identify an 
active site and to understand at the molecular level its mechanism of action. 
Moreover, determination of the amino acid sequences will allow comparison of 
structural features of interferons from different sources. This is an interesting 
possibility since interferons from different species and even from different cell 
types within an animal are thought to be different proteins. 
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ABSTRACT By techniques of recombination in vitro, we 
have constructed a plasmid bearing the repressor gene (cl) of 
bacteriophage X fused to the promoter of the lac operon. Strains 
carrying this plasmid overproduce X repressor. This functional 
cl gene was reconstituted by joining DNA fragments bearing 
different parts of that gene. Flush end fusion techniques, in- 
volving no sequence overlap, were necessary for the construc- 
tion; in certain cases, the abutting of the DNA molecules bearing 
ends generated by different restriction endonucleases creates 
a sequence at the junction which is recognized by one of the 
restriction endonucleases. 

We have constructed a plasmid in which the promoter of the 
lac operon has been placed adjacent to the repressor gene (cl) 
of bacteriophage X, by techniques of recombination in vitro. 
Escherichia coli strains carrying this plasmid overproduce re- 
pressor because transcription of cl originates mainly at the lac 
promoters and because each cell contains multiple copies of the 
plasmid. Certain novel aspects of our construction may be ap- 
plied to the construction in vitro of other hybrid operons. Other 
X repressor overproducing strains have been created by Gro- 
nenborn and Muller-Hill (1), who have isolated lac promoter-cl 
fusions in vivo. 

MATERIALS AND METHODS 

E. coli K12 strain 294 (endo I - , Bi~, rg - , niK + ; obtained from 
M. Meselson) was the host used in most of these experiments. 
A lac repressor overproducing strain (2), V2000 [F' pro lac 
,-Qi2- v n8 / A(lac-pro) Sm R (Xtmm 21 plac5)], was constructed 
and used in some experiments. The plasmid vectors pCRll (3) 
and pMB9 (4) were used. Plasmid DNA was prepared by the 
method of Clewell and Helinski (5). DNA fragments generated 
by restriction endonucleases were prepared by electrophoresis 
on polyacrylamide gels as described by Maniatis et al. (6). 
EcoRI, ffindlll, and Hoe III digestions were performed in Hin 
buffer (7) and Hpall digestions were performed in Hpa buffer 

(7) . Reactions using E. coli DNA polymerase I (DNA nucleo- 
tidyltransferase, deoxynucleosidetriphosphate:DNA deoxy- 
nucleotidyltransferase, EC 2.7.7.7) (a gift of W. McClure) were 
performed in 50 mM Tris-HCl at pH 7.8, 5 mM MgCl 2 , 1 mM 
2-mercaptoethanol, 50 Mg/ m l of bovine serum albumin plus 
2 jtM each of dATP, dCTP, dGTP, and dTTP for 1 hr at 15° 

(8) . T4 polynucleotide ligase [polynucleotide synthetase, po- 
ly(deoxyribonucleotide):poly(deoxyribonucleotide) lig- 
ase(AMP-forming), EC 6.5.1.1] reactions were performed in 
6.6 mM Tris-HCl at pH 7.4, 6.6 mM MgCl 2 , 6.6 mM 2-mer- 
captoethanol, 100 /iM ATP at 4° ; DNA fragments were present 
at 100-300 Mg/ml. For joining DNA fragments bearing flush 
ends (9), 5-fold higher concentrations of ligase were used than 
were required to join an equivalent number of staggered ends, 
and the reactions were incubated at room temperature for 4 hr. 
Cells were transformed with DNA as described by Cohen et al. 
(10). Drug resistant transformants were selected on drug sup- 
plemented agar plates, and X immune transformants were se- 



lected on agar plates seeded with 10 9 XKH54 (11) and 10 9 
XKH54li8o; in some cases, both selections were performed si- 
multaneously. The immunity of plasmid containing strains was 
further tested by streaking single colonies across a streak of 
phage on an agar plate. Colonies of transformants which con- 
stittltively synthesized /3-galactosidase appeared blue on agar 
plates containing a chromogenic, noninducing substrate, 5- 
chloro-4-bromo-3-indolyl-(8-D-galactoside (40jug/ml). Assays 
of X repressor (12), 0-galactosidase (/3-D-galactoside galac- 
tohydrolase, EC 3.2.1.23) (13), and the isolation of operator 
containing DNA fragments on nitrocellulose filters (14) have 
been described previously. Purified X repressor was a gift of R. 
Sauer and purified lac repressor was a gift of A. Maxam. Ex- 
periments were carried out in a PI (EK1) facility. 

RESULTS 
Preliminary considerations 

We wished to proceed in two steps; first, to clone cl flanked by 
as little extraneous phage DNA as possible on a plasmid and 
then to insert a DNA fragment bearing the lac promoter near 
the beginning of cl. Two problems arose. No single restriction 
endonuclease cleaves X DNA just outside the ends of cl without 
also cleaving within it, and no restriction endonuclease site 
suitable for inserting the lac promoter exists in X DNA near the 
beginning of cl. We adopted a strategy based on the following 
considerations: gene cl can be neatly isolated on two DNA 
fragments, one of which bears two Hindlll ends and the other 
of which bears one Hindlll end and one Haelll end (see Fig. 
1). Proper joining of these fragments reconstitutes cl on a larger 
fragment bearing one Hindlll end and one Haelll end. Hin- 
dlll ends are staggered and readily anneal to each other, 
whereas Haelll ends are flush. Staggered ends which anneal 
to each other can be joined by T4 polynucleotide ligase (15), 
and flush ends can be joined to other flush ends by that enzyme 
(9). To clone our cl fragment, we sought a plasmid which could 
be opened so as to produce one Hindlll end and one flush end. 
The plasmids we used, pCRll and pMB9, each have a single 
Hindlll site and a single EcoRI site. Although EcoRI produces 
staggered ends, we anticipated that these ends could be con- 
verted to flush ends by treatment with DNA polymerase I and 
the four deoxyribonucleotide triphosphates, because the re- 
cessed 3' end can be extended by copying the protruding 5' end 
of the complementary strand (a process we hereafter refer to 
as f illing-in). Precise joining of a filled-in EcoRI end to a Haelll 
end should produce a molecule recognized by EcoRI at the 
junction (see Fig. 2). This regenerated EcoRI site near the be- 
ginning of cl could then be used as a site to insert the lac pro- 
moter. 

The plasmids pCRll and pMB9 are derived from Col El, 
which is normally present in about twenty copies per cell (16). 
pCRll and pMB9 carry drug resistance determinants (kan- 
amycin and tetracycline, respectively). 
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Fig. 1. Restriction endonuclease sites in the vicinity of the cl gene 
of \. Haelll cleavage sites are indicated above, and Hindlll sites 
below, the map of the region including and flanking cl. (The extreme 
right ifaelll site is also indicated below the line.) Distances in base 
pairs between various cleavage sites are given. The extent of cl is in- 
dicated by the heavy line. The extreme right Haelll site is in the cro 
gene, and the extreme'left Hindlll site is in the gene which lies to the 
left of cl, rex. The arrow indicates the direction of transcription of 
cl. 



Cloning cl 

With these considerations in mind, we proceeded as follows (see 
Fig. 2): plasmid pCRll DNA was digested with Eco RI and the 
protruding single-stranded ends converted to double-stranded 
flush ends by filling-in with DNA polymerase I. The product 
was then digested with Hindlll to produce a vector DNA 
molecule bearing one Hindlll end and one flush end. This 
procedure removes at least part of the kanamycin resistance 
gene(s). Two DNA fragments bearing portions of cl were 
produced as follows (see Fig. 1): X DNA was digested with 
Haelll, and a 790 base pair fragment was isolated that carries 
the right portion of cl plus about 150 base pairs to the right of 
cl. This fragment was further digested with Hindlll to produce 
a fragment 630 base pairs long. Separately, X DNA was digested 
with Hindlll and a 520 base pair fragment bearing the left 
portion of cl was isolated. Together these two fragments span 
the entire cl gene. Gene cl was then simultaneously reconsti- 
tuted and inserted in the plasmid by treating a mixture of the 
specially prepared plasmid and the fragments which span cl 
with T4 polynucleotide ligase (see Fig. 2). Five X immune clones 
were isolated from bacteria transformed with the products of 
this reaction, and their plasmids were analyzed by restriction 
endonuclease digestion. One isolate, pKB155, produced three 
fragments (5000, 630, and 520 base pairs long) when cleaved 
with EcoRI and Hindlll, as predicted by the construction 
shown in Fig. 2 (E and F). In particular, the fused Haelll- 
EcoRI junction to the right of cl was cleaved again by 
EcoRI. 

The cl gene was transferred from pKB155, which bears no 
drug resistance marker, to the plasmid pMB9, which carries a 
tetracycline resistance gene, as follows: an 1150 base pair piece 
of DNA bearing the entire cl gene was excised from pKB155 
by complete digestion with EcoRI and partial digestion with 
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FIG. 2. Cloning the cl gene. Gene cl, carried on two DNA frag- 
ments, was inserted into the plasmid pCRll as diagrammed (see text). 
The plasmid pCRll (A) was digested with BcoRI (B), and the pro- 
truding 5' ends were filled in (see text) with DNA polymerase I (pol 
I) (C); the product was digested further with Hindlll (D). The Haelll 
790 base pair fragment digested with Hindlll and the Hindlll 520 
base pair fragment (see Fig. 1) were added to the prepared plasmid 
(E). No attempt was made to remove the extra pieces of DNA gener- 
ated by the ifmdlll digestions of the plasmid and of the Haelll 790 
fragment. The mixture was treated with polynucleotide ligase, and 
the desired recombinant (F) was selected as described in the text. 

Hindlll. Plasmid pMB9 was digested with EcoRI and Hindlll, 
which removes a nonessential 350 base pair fragment; the 1150 
base pair cl gene fragment was added, and the mixture was 
treated with ligase. Bacteria were transformed with the prod- 
ucts of ligation, and tetracycline resistant lambda immune 
clones were readily isolated. Restriction endonuclease analysis 
of several isolates (not shown), typical of which is pKB158, 
confirmed the structure shown in Fig. 3. 

Construction of a plasmid fusing the lac promoter to 
the cl gene 

A plasmid bearing a (nominally) 205 base pair fragment of 
DNA which carries the lac UV5 promoter-operator region was 
constructed by F. Fuller. He used a 203 base pair fragment of 
known sequence generated by Haelll. The Haelll ends of this 



fcoRI 




pMB9 pKB158 pKB252 

FIG 3. The structure of pMB9, pKB158, and pKB252. The Eco RI and Hindlll sites, the sizes of the cleavage products, and the location of 
cl in the various plasmids are shown. The location and orientation of the lac promoters in pKB252 are indicated. 
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FlG. 4. Structure of pKB252 in the lac promoter-cl gene region. 
The location of EcoRI and Hpall cleavage sites are shown, respec- 
tively, above and below the map of the 1550 base pair fragment pro- 
duced by Haelll digestion of pKB252. The locations of the lac and 
lambda operators on this fragment are indicated by the solid circles 
and solid box, respectively. Distances between various endonuclease 
sites and the position of the Hpall site in the lac promoter are given 
in base pairs. 

fragment were converted to EcoRI ends by joining the frag- 
ment to filled-in EcoRI ends on a plasmid by using the methods 
described above. Colonies harboring plasmids which carry the 
lac promoter-operator were identified by their constitutive 
synthesis of #-galactosidase, which renders them blue on agar 
plates containing 5-chloro-4-bromo-3-indolyl-/J-D-galactoside. 
The lac promoter fragment serves as a "portable promoter" 
with an EcoRI site 65 base pairs from the startpoint of tran- 
scription (corresponding to the ninth amino-acid residue of the 
lac z gene). This fragment was excised from its carrier with 
EcoRI and inserted near the beginning of cl at the single EcoRI 
site in pKB158. About one half of the tetracycline resistant 
clones which were isolated carried the lac promoter fragment 
on the plasmid, as manifested by their constitutive synthesis of 
/J-galactosidase. All of these clones were immune to Xvir (unlike 
the strain carrying pKB158) and sensitive to Ximm^cV in 
cross-streak tests, although many clones exhibited some cell 
death at the junction of the cross-streak with Xvir. Three clones 
which exhibited no apparent sensitivity to Xvir were selected 
for further study. Typical of the plasmids in these clones is 
pKB252. 

Structure of pKB252 

An EcoRI plus Hindlll digest of pKB252 yielded the 205 base 
pair promoter fragment as well as the fragments produced by 
a similar digestion of pKB158. When pKB252 was digested with 
Haelll, a 1550 base pair fragment was produced. This frag- 
ment, which extends from the Haelll site in cl to a Haelll site 
which is 350 base pairs beyond the EcoRI site in the plasmid 
DNA (17), was expected to contain the lac promoter. Analysis 
of this 1550 base pair fragment confirmed the structure of 
pKB252 as shown in Fig. 3. The products of Hpall digestion 
of this fragment were 205, 480, and 870 base pairs long (see the 
diagram of Fig. 4). Because the cl region of X does not have any 
Hpall sites (11), and the lac promoter fragment does have a 
Hpall site about 85 bases from one end, this pattern of frag- 
ments suggested the structure shown in Fig. 4. In particular, 
the 205 base pair product suggested the presence of multiple 
lac promoters. If only one promoter were present, then no such 
piece would have been produced. Moreover, all such promoters 
must have the same orientation; if any two promoter regions 
were oppositely oriented, then Hpall digestion would have 
produced a 160 or a 250 base pair fragment. 

The exact number of lac promoters in pKB252 was deter- 
mined as follows: uniformly 32 P labeled plasmid DNA was di- 
gested with Haelll and the 1550 base pair fragment was iso- 
lated. This fragment was digested with EcoRI and the products 
separated on a polyacrylamide gel. The resulting fragments 
(205, 360, and 790 base pairs) were cut from the gel and the 
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Table 1. The number of lac promoter fragments in pKB252 



Fragment size 


Radioactivity "P 




in base pairs 


counts per minute 


Stoichiometry 


790 


3110 


1 


360 


1345 


1 


205 


1350 


2 



Stoichiometry is calculated by comparing the ratios of Tradio- 
activity/fragment size. 



radioactivity in each fragment was determined (Table 1). The 
205 base pair fragment was present in twice molar quantities 
compared to the 360 and 790 base pair fragments which indi- 
cated that there are two lac promoters in pKB252. 

To prove that the lac promoters transcribe toward cl, we took 
advantage of the fact that Hpall cuts near the lac operator in 
the lac promoter. The Haelll 1550 fragment was digested with 
Hpall and the fragments produced were mixed with either lac 
or lambda repressor and passed through nitrocellulose filters. 
The fragments retained on the filters were eluted and analyzed 
on a polyacrylamide gel. It was found that the 870 base pair 
fragment was retained by both lac and X repressors, the 205 
base pair fragment was retained only by lac repressor, and the 
480 base pair fragment was not retained by either repressor. 
These results are consistent only with the orientation shown in 
Fig. 4 in which the lac promoters transcribe toward cl. 

Control of repressor synthesis in vivo 

Cell extracts of strains containing various plasmids were assayed 
for X repressor and, in some cases, an aliquot of the culture was 
assayed for /3-galactosidase. The results are listed in Table 2. 
Strains carrying plasmids bearing cl but not the lac promoter' 
(pKB155 and pKB158) contained about five times more re- 
pressor than did single lysogens. Strains carrying pKB252, which 
has two lac promoters, produced 35 times more repressor than 
did single lysogens. 

Synthesis of lambda repressor in pKB252 strains is regulated 
by lac repressor (Table 2). A strain bearing a wild-type lac op- 
eron does not make enough lac repressor to repress signif icandy 
the synthesis of X repressor from the lac promoters on pKB252 
(Table 2). When this plasmid was transferred to a strain which 



Table 2. Repressor synthesis in plasmid containing strains 




Repressor 


0-Galactosidase 


Strain 


level X 10 


level 


28(\) 


0.3 


N.D. 


294/pKB155 


1.0 


N.D. 


294/pKB158 


1.5 


3 


294/pKB252 


9.7 


2.8 X 10 3 


294/pKB252 (+IPTG) 


10.7 


2.8 X 10 3 


V2000/pKB252 


1.3 


10 


V2000/pKB252 (+IPTG) 


2.9 


1.9 X 10 3 



Various strains were grown overnight at 37° in M9 salts and glu- 
cose medium, and were assayed for lambda repressor and /3-galac- 
tosidase. The results are given in units of repressor per milligram of 
protein. One unit is the amount of repressor which gives half-maxi- 
mal binding in the nitrocellulose filter binding assay and cor- 
responds to approximately 0.5 ng of repressor. /3-Galactosidase re- 
sults are given in the units of Miller (13). A single lysogen, 28 (X), 
is included for comparison purposes. Strain 294 makes wild-type 
lac repressor levels, whereas strain V2000 over-produces lac repres- 
sor about 100-fold (2). ITPG, isopropyl-thiogalactoside. N.D., not 
determined. 
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Table 3. Use of the filling-in method with various 
restriction endonucleases 



Require- 

Restric- ment for Subsite 

tion endo- Recognition site regen- Recognition 

nuclease sequence eration Subsite sequence 



£coRI 


1 

GAATTC 
CTTAAG 

t 


5' C 


£coRI* 


I 

AATT 
TTAA 

t 


EcoRIl 


i 

CCAGG 
GGTCC 

t 


None 


Eco RII 


I 

CCAGG 
GGTCC 

t 


Hind III 


I 

AAGCTT 
TTCGAA 

t 


5' T 


Alul 


AGCT 
TCGA 
t 


Baml 


I 

GGATCC 
CCTAGG 

t 


5' C 


Mbol 


1 

GATC 
CTAG 
t 


Bglll 


i 

AGATCT 
TCTAGA 

t 


5' T 


Mbol 


i 

GATC 
CTAG 
t 



Several endonucleases which produce 5' protruding ends are listed 
with the DNA sequence which they recognize (20-24, R. J. Roberts, 
G. Wilson, and F. E. Young, submitted for publication). After filling 
in these ends with DNA polymerase I, the original recognition se- 
quence can be regenerated by joining the filled -in end to a flush ended 
DNA molecule which meets the requirement listed in the third col- 
umn. When joined to a flush ended fragment not meeting that re- 
quirement, the filled-in end generates a sub-specificity, which is de- 
• scribed in the fourth and fifth columns (20, 25, R. E. Gelinas, P. A. 
Myers, K. Murray, and R. J. Roberts, unpublished). The top line of 
each recognition sequence reads from the 5' to the 3' end; the bottom 
line reads in the opposite direction. 

overproduces lac repressor, the synthesis of X repressor was 
reduced to the basal (i.e., pKB158) level. IPTG (isopropyl-thio- 
galactoside), an inducer of the lac operon, partially induces 
/3-galactosidase synthesis in cells which overproduce lac re- 
pressor. Table 2 shows that isopropyl-thiogalactoside also par- 
tially induces X repressor synthesis when such cells carry 
pKB252. 

DISCUSSION 

We have described the use of techniques of DNA recombina- 
tion in vitro to construct a plasmid on which the X repressor 
gene (cl) is transcribed largely from the promoter of the lac 
operon. This plasmid has two copies of the lac promoter which 
transcribe toward cl. As predicted by this arrangement, the 
synthesis of X repressor in strains bearing this plasmid is regu- 
lated by the lac repressor. The lac promoter fragments in 
pKB252 contain the startpoint of translation of the lac z gene; 
however, several translational stop signals immediately precede 
cl (18), and repressor synthesized in strains carrying pKB252 
is not a fusion product initiated at the lac z translational start- 
point. Strains carrying this plasmid produce about 35 times 
mo;e repressor than do single lysogens, as measured by DNA 
binding activity; however, these levels are variable and are often 
as much as 3-fold higher. The source of this variation is not 
understood. These repressor levels are 10- to 25-fold lower than 
would be expected on the basis of the known strength of the lac 
promoter (19). This inefficient expression might occur because 
transcription beginning at the lac promoter attenuates fre- 



quently before transcribing cl, because the progress of RNA 
polymerase is blocked by X repressor bound to the operator just 
to the right of cl, or because the mRNA produced is ineffi- 
ciently translated. The level of repressor was not higher when 
plasmid pKB252 was carried by a strain bearing an SuA 
mutation. Nevertheless, the high levels of repressor made by 
strains carrying pKB252 have greatly facilitated structural 
studies of repressor. 

Strains carrying plasmids in which cl is transcribed from its 
own promoter produce about five times more repressor than 
does a single lysogen, even though these plasmids should be 
present in about 20 copies per cell (16). Gene cl is known to 
regulate its own synthesis (18), and these strains are exemplary 
of such autogenous regulation. 

The expression of tetracycline resistance in strains carrying 
pKB158 and pKB252 may be under the control of the promoter 
for cl (Prm) and the lac promoter. H. W. Boyer (personal 
communication) has found evidence suggesting that the Hin- 
dlll site in pMB9 is within the promoter of the tetracycline 
resistance gene. In our experiments, tetracycline resistant X 
immune clones were readily isolated which were missing the 
pMB9 specific material between the EcoRI and Hindlll sites; 
presumably the expression of tetracycline resistance in these 
cases results from an extension of the cl mRNA. 

Our construction required the use of new techniques which 
are generally applicable. We have assembled a hybrid operon 
from separate pieces; some junctions involved sequence overlap, 
some did not. Furthermore, the DNA polymerase I filling-in 
technique allowed us to join precisely DNA fragments which 
could not be joined directly by treatment with ligase. These 
techniques greatly expand the spectrum of restriction en- 
donucleases which are useful in cloning experiments. An 
especially important feature of these techniques is the ability 
to regenerate endonuclease recognition sites at junctions in- 
volving filled-in ends. We demonstrate this explicitly in one case 
(the joining of a filled-in EcoRI end to a HacIII end), and 
available sequence data suggest that it will be a general feature 
of the method (see Table 3). 
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MATURE HUMAN FIBROBLAST 
INTERFERON 



CROSS-REFERENCE TO RELATED 
APPLICATION 

This is a continuation of application Ser. No. 889,722, 
filed Jul. 28, 1986, now abandoned which is a divisional of 
application Ser. No. 291,892, now abandoned filed Aug. 11, 
1981, which is a continuation-in-part of application Ser. No. 
190,799, filed Sep. 25, 1980, now abandoned. 

FIELD OF THE INVENTION 

This invention relates to the microbial production, via 
recombinant DNA technology, of human fibroblast inter- 
feron for use in the treatment of viral and neoplastic dis- 
eases, and to the means and end products of such production. 

BACKGROUND OF THE INVENTION 

The publications and other materials referred to herein to 
illuminate the background of the invention and, in particular 
cases, to provide additional detail respecting its practice are 
incorporated herein by reference and, for convenience, are 
numerically referenced in the following text and respec- 
tively grouped in the appended bibliography. 
Recombinant DNA Technology 

With the advent of recombinant DNA technology, the 
controlled microbial production of an enormous variety of 
useful polypeptides has become possible. Already in hand 
are bacteria modified by this technology to permit the 
production of such polypeptide products such as somatosta- 
tin, the (component) A and B chains of human insulin, 
human growth hormone. More recently, recombinant DNA 
techniques have been used to occasion the bacterial produc- 
tion of proinsulin, thymosin alpha 1, (an immune potenti- 
ating substance produced by the thymus) and leukocyte 
interferon. 

The workhorse of recombinant DNA technology is the 
plasmid, a non-chromosomal loop of double-stranded DNA 
found in bacteria and other microbes, oftentimes in multiple 
copies per cell. Included in the information encoded in the 
plasmid DNA is that required to reproduce the plasmid in 
daughter cells (i.e., a "replicon") and ordinarily, one or more 
selection characteristics such as, in the case of bacteria, 
resistance to antibiotics which permit clones of the host cell 
containing the plasmid of interest to be recognized and 
preferentially grown in selective media. The utility of plas- 
mids lies in the fact that they can be specifically cleaved by 
one or another restriction endonuclease or "restriction 
enzyme", each of which recognizes a different site on the 
plasmidic DNA. Thereafter heterologous genes or gene 
fragments may be inserted into the plasmid by endwise 
joining at the cleavage site or at reconstructed ends adjacent 
to the cleavage site. DNA recombination is performed 
outside the cell, but the resulting "recombinant" plasmid can 
be introduced into it by a process known as transformation 
and large quantities of the heterologous gene-containing 
recombinant plasmid obtained by growing the transformant 
Moreover, where the gene is properly inserted with refer- 
ence to portions of the plasmid which govern the transcrip- 
tion and translation of the encoded DNA message, the 
resulting expression vehicle can be used to actually produce 
the polypeptide sequence for which the inserted gene codes, 
a process referred to as expression. 
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Expression is initiated in a region known as the promoter 
which is recognized by and bound by RNA polymerase. In 
some cases, as in the tryptophan or "trp" promoter preferred 
in the practice of the present invention, promoter regions are 

5 overlapped by "operator" regions to form a combined pro- 
moter-operator. Operators are DNA sequences which are 
recognized by so-called repressor proteins which serve to 
regulate the frequency of transcription initiation at a par- 
ticular promoter. The polymerase travels along the DNA, 

10 transcribing the information contained in the coding strand 
from its 5' to 3' end into messenger RNA which is in turn 
translated into a polypeptide having the amino acid sequence 
for which the DNA codes. Each amino acid is encoded by a 
nucleotide triplet or "codon" within what may for present 

15 purposes be referred to as the "structural gene", i.e. that part 
which encodes the amino acid sequence of the expressed 
product. After binding to the promoter, the RNA polymerase 
first transcribes nucleotides encoding a ribosome binding 
site, then a translation initiation or "start" signal (ordinarily 

20 ATG, which in the resulting messenger RNA becomes 
AUG), then the nucleotide codons within the structural gene 
itself. So-called stop codons are transcribed at the end of the 
structural gene whereafter the polymerase may form an 
additional sequence of messenger RNA which, because of 

25 the presence of the stop signal, will remain untranslated by 
the ribosomes. Ribosomes bind to the binding site provided 
on the messenger RNA, in bacteria ordinarily as the mRNA 
is being formed, and themselves produce the encoded 
polypeptide, beginning at the translation start signal and 

30 ending at the previously mentioned stop signal. The desired 
product is produced if the sequences encoding the ribosome 
binding site are positioned properly with respect to the AUG 
initiator codon and if all remaining codons follow the 
initiator codon in phase. The resulting product may be 

35 obtained by lysing the host cell and recovering the product 
by appropriate purification from other bacterial protein. 
Fibroblast Interferon 

Human fibroblast interferon (KLF) is an antiviral protein 
which also exhibits a wide range of other biological activi- 

40 ties (see ref. 1 for review). It has reportedly been purified to 
homogeneity as a single polypeptide of 19,000-20,000 
molecular weight having a specific activity of 2 to 1 Ox 10 s 
units/mg (2,3). The sequence of the 13 NH 2 -terminal amino 
acids of FIF has been determined (4). Houghton et al. (5) 

45 have used synthetic deoxyoligonucleolides (predicted from 
this amino acid sequence) to determine the sequence of the 
276 5'-terminal nucleotides of FIF mRNA. Taniguchi et al. 
(6) and Derynck et al. (7) have recently employed RNA 
selection procedures to identify cloned cDNA copies of FIF 

50 mRNA in E. coli. See also Taniguchi et al., Gene 10, 11 
(1980) and Proc. Natl. Acad. Sci. (U.S.A.) 77, 5230 (1980) 
and Nature 285, 547 (1980). 

While isolation from donor fibroblasts has provided suf- 
ficient material for partial characterization and limited clini- 

55 cal studies with homogeneous fibroblast interferon, it is a 
totally inadequate source for the amounts of interferon 
heeded for large scale clinical trials and for broad scale 
prophylactic and/or therapeutic use thereafter. Indeed, pres- 
ently clinical investigations employing human fibroblast- 

60 derived interferon in antitumor and antiviral testing have 
principally been confined to crude (<1 percent pure) prepa- 
rations of the material, and long lead times for the manu- 
facture of sufficient quantities, even at unrealistic price 
levels, have critically delayed investigation on an expanded 

65 front. 

We perceived that application of recombinant DNA tech- 
nology would be the most effective way of providing large 
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quantities of fibroblast interferon which, despite the absence 
in material so produced of the glycosylation characteristic of 
human-derived material, could be employed clinically in the 
treatment of a wide range of viral and neoplastic diseases. 

More particularly, we proposed and have since succeeded 5 
in producing mature human fibroblast interferon microbi- 
ally, by constructing a gene therefor which could then be 
inserted in microbial expression vehicles and expressed 
under the control of microbial gene regulatory controls. 

Our approach to obtaining a fibroblast gene involved the 10 
following tasks: 

1. Partial amino acid sequences would be obtained by 
characterization of fibroblast interferon purified to 
essential homogeneity, and sets of synthetic DNA 
probes constructed whose codons would, in the aggre- 15 
gate, represent all the possible combinations capable of 
encoding the partial amino acid sequences. 

2. Bacterial colony banks would be prepared containing 
cDNA from induced messenger RNA. The probes of 
part (1) would be used to prime the synthesis of 20 
radio-labelled single stranded cDNA for use as hybrid- 
ization probes. The synthetic probes would hybridize 
with induced mRNA as template and be extended by 
reverse transcription to form induced, radio-labelled 
cDNA. Clones from the colony bank that hybridized to 25 
radio-labelled cDNA obtained in this manner would be 
investigated further to confirm the presence of a full- 
length interferon encoding gene. Any partial length 
putative gene fragment obtained would itself be used as 

a probe for the full-length gene. 30 

3. The full-length gene obtained above would be tailored, 
using synthetic DNA, to eliminate any leader sequence 
that might prevent microbial expression of the mature 
polypeptide and to permit appropriate positioning in an 35 
expression vehicle relative to start signals and the 
ribosome binding site of a microbial promoter. 
Expressed interferon would be purified to a point 
permitting confirmation of its character and determi- 
nation of its activity notwithstanding the absence of 4f) 
glycosylation. 

SUMMARY OF INVENTION 

A series of replicable plasmidic expression vehicles have 45 
been constructed which direct the high level synthesis in 
transformant microorganisms of a mature polypeptide with 
the properties of authentic human fibroblast interferon. The 
product polypeptide exhibits the amino acid sequence of 
such interferon and is active in in vitro testing despite the 50 
lack of glycosylation characteristic of the human-derived 
material. Reference herein to the expression of "mature 
fibroblast interferon," connotes the bacterial or other micro- 
bial production of an interferon molecule unaccompanied by 
associated glycosylation and the presequence that immedi- 55 
ately attends mRNA translation of the human fibroblast 
interferon genome. Mature fibroblast interferon, according 
to the present invention, is immediately expressed from a 
translation start signal (ATG) which also encodes the first 
amino acid codon of the natural product. The presence or 60 
absence of the methionine first amino acid in the microbially 
expressed product is governed by a kinetic phenomenon 
dependent on fermentation growth conditions and/or levels 
of expression in the transformant host. Mature fibroblast 
interferon could be expressed together with a conjugated 65 
protein other than the conventional leader, the conjugate 
being specifically cleavable in an intra- or extracellular 



m 
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environment. See British Patent Publication No. 2007676A. 
Finally, the mature interferon could be produced in conjunc- 
tion with a microbial "signal" peptide which transports the 
conjugate to the cell wall, where the signal is processed 
away and the mature polypeptide secreted. 

DESCRIPTION OF THE FIGURES 

FIG. 1 depicts the protein sequence information used to 
design degenerate dodecanucleotide primers. Below the 
protein sequence are corresponding mRNA sequences and 
six pools of complementary deoxyoligonucleotide primers. 

FIGS. 2A-C are autoradiographs of nitrocellulose-bound 
plasmid DNA prepared from some of the 600 bacterial 
transformanls having DNA from the fibroblast cDNA 
library. The plasmid DNA of the nitrocellulose membrane 
was hybridized with either probe A, B, or C. 

FIG. 3 presents DNA sequence of the cDNA insert of 
clone pFIF3 as determined by the Maxam-Gilbert proce- 
dure. The deduced protein sequence in printed above the 
DNA sequence. 

FIG. 4 is a diagram showing the approach used to remove 
the signal peptide coding regions from pFIF3. 

FIG. 5 is an autoradiograph of the polyacrylamide gel 
used to isolate the 141bp fragment lacking the signal peptide 
coding regions. 

FIG. 6, and 6a, 6b, 6c schematically depicts the construc- 
tion of plasmids coding for the direct expression of mature 
fibroblast interferon. Restriction sites and residues are as 
shown ("Pst I", etc.). "Ap R " and "Tc*" connote portions of 
the plasmid (s) which express, respectively, ampieillin and 
tetracycline resistance. The legend "p o" is an abbreviation 
for "promoter operator." 

DESCRIPTION OF THE PREFERRED 
EMBODIMENTS 

A. Microorganisms Employed 

The work described involved used of the microorganism: 
E. coli K-12 strain 294 (end A, thi~, hsr~, hsm^), as 
described in British Patent Publication No. 2055382 A. This 
strain has been deposited on Oct. 28, 1978 with the Ameri- 
can Type culture Collection, which is located at 12301 
Parklawn Drive. Rockville, Md. 20852, and given ATCC 
Acession No. 31446. All recombinant DNA work was 
performed in compliance with applicable guidelines of the 
National Institutes of Health. 

The invention, in its most preferred embodiments, is 
described with reference to E. coli, including not only strain 
E. coli K-12 strain 294, defined above, but also other known 
E. coli strains such as E. coli B, E. coli x 1776 and E. coli 
W 3110, or other microbial strains many of which are 
deposited and (potentially) available from recognized 
microorganism depository institutions, such as the American 
Type Culture Collection (ATCC)— cf. the ATCC catalogue 
listing. See also German Offenlegungschrift 2644432. These 
other microorganisms include, for example, Bacilli such as 
Bacillus subtilis and other enterobacteriaceae among which 
can be mentioned as examples Salmonella typhimurium and 
Serratia marcesans, utilizing plasmids that can replicate and 
express heterologous gene sequences therein. Yeast, such as 
Saccharomyces cerevisiae, may also be employed to advan- 
tage as host organism in the preparation of the interferon 
proteins hereof by expression of genes coding therefor under 
the control of a yeast promoter. 
MATERIALS AND METHODS 

General methods. 
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Restriction enzymes were purchased from New England 
Biolabs and used as directed. Plasmid DNA was prepared by 
a standard cleared lysate procedure (8) and purified by 
column chromatography on Biogel A-50M (Bio-Rad). DNA 
sequencing was performed using the method of Maxam and 5 
Gilbert (9). DNA restriction fragments were isolated from 
polyacrylamide gels by electroelution. DNA fragments were 
radiolabeled for use as hybridization probes by the random 
calf thymus DNA priming procedure of Taylor et al. (10). In 
situ colony hybridizations were performed by the Grunstein- 10 
Hogness procedure (11). 

Chemical synthesis of deoxyoligonucleotides. 

The deoxyoligonucleotides were synthesized by the 
modified phosphotriester method in solution (12), using 
trideoxynucleotides as building block (13). The material and 15 
general procedures were similar to those described (14). The 
six pools of primers (Fib 1-6) containing four dodecanucle- 
otides each were obtained by separately coupling two hex- 
amer pools (of two different 5'-terminal sequences each) 
with three different hexamer pools (of two different 3'-ter- 20 
minal sequences each). 

Induction of fibroblasts. 

Human fibroblasts (cell line GM-2504A) were grown as 
described previously (15). Growth medium (Eagles's mini- 
mal essential medium containing 10 percent fetal calf 25 
serum) was removed from roller bottles (Corning, 850 cm 2 ) 
and replaced with 50 ml growth medium containing 50 
ug/ml of poly (I):poly (C) (PL Biochemicals) and 10 ug/ml 
cycloheximide. This induction medium was removed after 4 
hours at 37° C. and cell monolayers were washed with PBS 30 
(0.14M NaCl, 3 mM KC1, 1.5 mM KH 2 P0 4 , 8 mM 
Na 2 HP0 4 ). Each bottle was incubated at 37° C. with 10 ml 
of a trypsin-EDTA solution (Gibco 610-5305) until cells 
were detached, and fetal calf serum was added to a concen- 
tration of 10 percent. Cells were spun for 15 minutes at 500x 35 
g and pellets were resuspended in PBS, pooled, and resedi- 
mented. Cells were frozen in liquid nitrogen. Approximately 
0.17 g of cells were obtained per roller bottle. 

Preparation and assay of interferon mRNA. 

Poly extraction and oligo(dT)-cellulose chromatography 40 
as described elsewhere (16). The poly (A) containing RNA 
was enriched for interferon mRNA by centrifugation on a 
linear 5 percent to 20 percent (w/v) sucrose gradient. The 
RNA samples were heated to 80° C. for 2 minutes, rapidly 
cooled, layered over the gradient, and centrifuged for 20 45 
hours at 30,000 rpm at 4° C. in a Beckman SW-40 rotor. 
Fractions were collected, ethanol precipitated, and dissolved 
in H 2 0. 

One microgram samples of mRNA were injected into 
Xenopus laevis oocytes as described previously (17,18 ). 50 
The injected oocytes were incubated 24 hours at 21° C, 
homogenized, and centrifuged for 5 minutes at 10,000x g. 
The interferon in the supernatant was determined by the 
cytopathic effect (CPE) inhibition assay (1) using Sindbis 
virus and human diploid (WISH) cells. Interferon titers of 55 
1,000 to 6,000 units recovered (NIH reference standard) per 
microgram of RNA injected were routinely obtained for the 
12S species of mRNA. 

Synthesis and cloning of cDNA. 

Single stranded cDNA was prepared in 100 pi reactions 60 
containing 5 ug of 12S fraction mRNA, 20 mM Tris-HCl 
(pH 8.3), 20 mM KC1, 8 mM MgCl 2 , 30 mM P-mercapto- 
ethanol, 100 uCi of (a 32 p)dCTP (Amersham) and 1 mM 
dATP, dCTP, dGTP, dTTP. The primer was the synthetic 
Hind III decamer dCCAAGCTTGG (19), which had been 65 
extended at the 3' terminus with about 20 to 30 deoxythy- 
midine residues using terminal deoxynucleotidyl transferase 



(20). 100 units of AMV reverse transcriptase were added and 
the reaction mixture was incubated at 42° C. for 30 minutes. 
The second strand DNA synthesis was carried out as 
described previously (21). The double stranded cDNA was 
treated with 1200 units of SI nuclease (Miles Laboratories) 
for 2 hours at 37° C. in 25 mM sodium acetate (pH 4.5), 1 
mM ZnCl 2 , 0.3M NaCl. After phenol extraction the mixture 
was separated electrophoretically on an 8 percent polyacry- 
lamide gel. cDNA (~0.5 ug) ranging from 500 to 1500 base 
pairs in size was recovered by electroelution. A 20 ng aliquot 
was extended with deoxyC residues using terminal deoxy- 
nucleotidyl transferase (20), and annealed with 100 ng of 
pBR322 which had been cleaved with Pst I and tailed with 
deoxyG residues (20). The annealed mixture was used to 
transform E. colt k-12 strain 294 (22) by a published 
procedure (23). Strain 294 was used throughout in the work 
described here, and has been deposited with the American 
Type Culture collection, accession no. 31446. 

Preparation of induced and uninduced 32 p-cDNA probes. 
5 ug of 12S mRNA were combined with either 2 ug of oligo 
(dT) 12 . lg (Collaborative Research) or 5 ug of each synthetic 
primer pool (Fib 1 to Fib 6) in 60 ul of 10 mM Tris-HCl (ph 
8), 1 mM EDTA. The mixtures were boiled 3 minutes, and 
quenched on ice. 60 ul of 40 mM Tris-HCl (pH 8.3), 40 mM 
KC1, 16 mM MgCl 2 ,60 mM fl-mercaptoethanol, 1 mM 
dATP, dGTP, dTTP and 5xlO"- 7 M (oe- 32 p) dCTP (Amersham, 
2,000-3,000 Ci/mmole) was added to each template-primer 
mix at 0° C. After the addition of 100 units of AMV reverse 
transcriptase, the reactions were incubated at 42° C. for 30 
minutes and purified by passage over 10 ml Sephadex G-50 
columns. The products were treated with 0.3N NaOH for 30 
minutes at 70° C, neutralized, and ethanol precipitated. 

The 32 p-cDNAs were combined with 100 ug of poly (A) 
mRNA from uninduced fibroblasts in 50 ul of 0.4M sodium 
phosphate (pH6.8), 0.1 percent SDS. The mixtures were 
heated at 98° C. for 5 minutes and allowed to anneal 15 
hours at 45° C. The DNA-RNA hybrids, (containing unin- 
duced cDNA sequences) were separated from single- 
stranded DNA (induced cDNA sequences) by chromatogra- 
phy on hydroxyapatite as described by Galau et al. (24). The 
DNA-RNA hybrids were treated with alkali to remove RNA. 

Screening of recombinant plasmids with 32 p-cDNA 
probes. 

Approximately 1 |ig samples of plasmid DNA were 
prepared from individual transformants by a published pro- 
cedure (25). The DNA samples were linearized by digestion 
with Eco RI, denatured in alkali, and applied to each of three 
nitrocellulose filters (Schleicher and Schuell, BA85) by the 
dot hybridization procedure (26). The filters were hybridized 
with the 32 p-cDNA probes for 16 hours at 42° C. in 50 
percent formamide, lOx Denhardt's solution (27), 6xSSC, 
40 mM Tris-HCl (pH 7.5), 2 mM EDTA, 40 ug/ml yeast 
RNA. Filters were washed with O.lxSSC, 0.1 percent SDS 
twice for 30' at 42° C, dried, and exposed to Kodak XR-2 
x-ray film using Dupont Lightning-Plus intensifying screens 
at -80° C. 

Construction of plasmids for direct expression of FIR 
The synthetic primers I (dATGAGCTACAAC) and H 
(dCATGAGCTACAAC) were phosphorylated using T4 
polynucleotide kinase and (y- 32 .P)ATP (Amersham) to a 
specific activity of 700 Ci/mmole as described previously 
(28). Primer repair reactions were performed as follows: 250 
pmoles of the 32 P-primers were combined with 8 ug (10 
pmole) of a 1200 bp Hha I restriction fragment containing 
the FTP cDNA sequence. The mixture was ethanol precipi- 
tated, resuspended in 50 ul H 2 0, boiled 3 minutes, quenched 
in a dry ice-ethanol bath, and combined with a 50 ul solution 
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of 20 mM Tris-HCl (pH 7.5), 14 mM MgCl 2 , 120 mM NaCl, 
0.5 mM dATP, dCTP, dGTP, dTTP at 0° C. 10 units of DNA 
polymerase I Klenow fragment (Boehringer-Mannheim) 
were added and the mixture was incubated at 37° C. for 4Vi 
hours. Following extraction with phenol/CHCl 3 and restric- 5 
tion with Pst I, the desired product was purified on a 6 
percent polyacrylamide gel. Subsequent ligations were done 
at room temperature (cohesive termini) or 4° C. (blunt ends) 
using previously detailed conditions (21,28). 

Assay for interferon expression in E. coli. to 
Bacterial extracts were prepared for IF assay as follows: 
One ml cultures were grown overnight in LB (29) containing 
5 ug/ml tetracycline, then diluted into 25 ml of M9 medium 
(29) containing 0.2 percent glucose, 0.5 percent casamino 
acids and 5 ug/ml tetracycline. 10 ml samples were har- 15 
vested by centrifugation when A 550 (Absorbance at 500 
nanometers) reached 1.0. The cell pellets were quickly 
frozen in a dry ice-ethanol bath and cleared lysates were 
prepared as described by Clewell (8). Interferon activity in 
the supernatants was determined by comparison with NIH 20 
FTF standards using cytopathic effect (CPE) inhibition 
assays as reviewed previously (1). Two different assays were 
used: (a) WISH (human amnion) cells were seeded in 
microtiter dishes. Samples were added 16 to 20 hours later 
and diluted by serial 2-fold dilution. Sindbts virus was added 25 
after at least 3 hours in incubation. Plates were stained 20 to 
24 hours later with crystal violet, (b) MDBK (bovine kid- 
ney) cell line was seeded simultaneously with 2-fold dilu- 
tions of samples. Vesicular stomatitis virus was added after 
2 to 3 hours incubation and plates were stained with crystal 30 
violet 16 to 18 hours later. To test pH 2 stability bacterial 
extracts and standards were diluted in minimal essential 
medium to a concentration of 1 000 units/ml. One ml aliquots 
were adjusted to pH 2 with IN HC1, incubated at 4° C. for 
16 hours, and neutralized by addition of NaOH. IF activity 35 
was determined by the CPE inhibition assay using human 
amnion cells. To establish antigenic identity 25 ul aliquots of 
the 1000 U/ml interferon samples (untreated) were incu- 
bated with 25 ul of rabbit antihuman leukocyte interferon for 
60' at 37° C, centrifuged at 12,000x g for 5 minutes and the 40 
supernatant assayed. Fibroblast and leukocyte interferon 
standards were obtained from the National Institutes of 
Health. Rabbit antihuman leukocyte interferon was obtained 
from the National Institute of Allergy and Infectious Dis- 
eases. 45 
RESULTS 

Chemical synthesis of primer pools complementary of 
FIF mRNA. 

The amino-terminal protein sequence of human fibroblast 
interferon (4) permitted us to deduce the 24 possible mRNA 50 
sequences which could code for the first four amino acids. 
The 24 complementary deoxyoligonucleotides were synthe- 
sized in 6 pools of 4 dodecamers each (FIG. 1). 

The six pools of 4 deoxyoligonucleotides each were 
synthesized by a modified phosphotriester method that has 55 
been used previously for the rapid synthesis of oligonucle- 
otides in solution (12) and on solid phase (14). The basic 
strategy involved reacting two different 3'-blocked trimers 
with an excess of a single 5'-protected trimer to yield a pool 
of two hexamers, each represented equally. The coupling of 60 
two pools, each containing two hexamers, then resulted in a 
pool of four dodecamers. 

Identification of FIF cDNA clones. 

Using 12S mRNA from induced human fibroblasts (1 ,000 
units IF activity per ug in oocyte assay), double stranded 65 
cDNA was prepared and inserted into pBR322 at the Pst I 
site by the standard dG:dC tailing method (20). A fibroblast 
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cDNA library consisting of 30,000 ampicillin-sensitive, let- 
racycline-resistant transformants of E. coli K-12 strain 294 
was obtained from 20 ng of cDNA ranging in size from 550 
to 1300 base pairs. Plasmid DNA was prepared from 600 of 
the transformants and applied to 3 sets of nitrocellulose 
filters as described in Materials and Methods. 

The approach followed in the identification of hybrid 
plasmids containing fibroblast interferon cDNA sequences 
was similar to that used to identify human leukocyte inter- 
feron recombinant plasmids (30). Radiolabeled cDNA 
hybridization probes were prepared using either the 24 
synthetic dodecamers or oligo(dT)i 2 _i S as primers and 12S 
RNA from induced fibroblasts (5000 units/ug in oocytes) as 
template. The 32 P-cDNAs (specific activity >5xl0 s cpm/ug) 
obtained were hybridized to a large excess of mRNA iso- 
lated from uninduced human fibroblasts, and the rnRNA- 
cDNA hybrids were separated from unreacted cDNA by 
hydroxyapatite chromatography (24). The single stranded 
cDNA fractions should be enriched for sequences which are 
present in induced fibroblasts but absent in uninduced cells, 
and the mRNA-cDNA hybrids should represent sequences 
common to both induced and uninduced cells. Approxi- 
mately 4x10° cpm of single stranded cDNA (hybridization 
probe A) and 8xl0 6 cpm of cDNA-mRNA hybrids were 
obtained using oligo(dT) 12 _ l8 primed cDNA; 1.5xl0 6 cpm 
of single stranded (hybridization probe B) and 1.5xl0 5 cpm 
of hybrids were obtained from cDNA primed using synthetic 
dodecamer pools Fib 1-6. The cDNA-mRNA hybrids from 
both fractionations were combined, the RNA hydrolyzed by 
treatment with alkali, and the 32 P-cDNA used as hybridiza- 
tion probe C. Many of the 600 plasmid samples hybridized 
with both probes A and C, indicating that the hybridization 
reactions between uninduced mRNA and 32 P-cDNA (prior 
to the hydroxyapatite fractionation step) had not gone to 
completion. However, only one of the 600 plasmids (pF526) 
hybridized strongly with the specifically primed, induced 
cDNA probe B (FIG. 2). Plasmid pF526 also hybridized 
with the total oligo(dT) 12 . 18 primed, induced cDNA probe 
A, and failed to give detectable hybridization to the com- 
bined uninduced probe C. 

Pst 1 digestion of pF526 showed the cloned cDNA insert 
to be about 550 base pairs long, probably too short to contain 
the entire coding region for a protein the size of fibroblast 
interferon. Therefore, a 32 P-labeled DNA probe was pre- 
pared from this Pst I fragment by random priming with calf 
thymus DNA (10). This probe was used to screen 2000 
individual colonies from a newly constructed fibroblast 
cDNA library (the new cDNA library was prepared using 
12S mRNA from induced fibroblasts having a titer of 6,000 
units/ml in the oocyte assay system). Sixteen clones hybrid- 
ized to the probe. Plasmids prepared from the majority of 
these released two fragments when cleaved with Pst 1, 
indicating that the cDNA contained an internal Pst I site. 
Clone pFIF3 contained the largest cDNA insert, about 800 
base pairs. The DNA sequence of the insert was determined 
by the Maxam-Gilbert procedure (9) and is shown in FIG. 3. 
The amino acid sequence of human fibroblast interferon 
predicted from the nucleotide sequence is identical to that 
reported recently by Taniguchi et al. (31) and by Derynck el 
al. (7) from DNA sequencing of FIF cDNA clones. A 
precursor or signal peptide of 21 amino acids is followed by 
a mature interferon polypeptide of 166 amino acids, a stretch 
of 196 3'-untranslated nucleotides and a poly(A) tail. The 
NH 2 -terminal 20 amino acids of mature FIF have been 
directly determined by protein microsequencing and arc the 
same as those predicted from the DNA sequence. The 
calculated formula molecular weight of mature human fibro- 
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blast interferon having the 166 amino acids shown in FIG. 
3 is about 20,027. 

Direct expression of fibroblast interferon. 

To express high levels of mature fibroblast interferon in E. 
coli, initiation of protein synthesis must occur at the ATG 5 
codon of the mature polypeptide (amino acid 1) rather than 
at the ATG of the signal peptide (amino acid SI) (FIG. 3). 

Our approach to removing the signal peptide coding 
regions from pFIF3 is depicted in FIG. 4. A 1200 bp DNA 
fragment which contained the entire cDNA insert was iso- 10 
lated from a polyacrylamide gel after digesting pFIF3 with 
Hha I. Two separate synthetic deoxyoligonucleotide prim- 
ers, dATGAGCTACAAC(I) and dCATGAGCTACAAC(II), 
were prepared. Both primers contain the coding sequence for 
the first four amino acids of mature fibroblast interferon; 15 
primer II has an additional C at the 5'-terminus. Primer 
repair reactions and subsequent ligations were carried out 
separately for primers I and n, and gave nearly identical 
results. Therefore, only reactions using primer I are dis- 
cussed in detail here. The primers were 5'-radiolabeled using 20 
(7- 32 P)ATP and T4 polynucleotide kinase, combined with 
the 1200 bp Hha I DNA fragment and the mixture denatured 
by boiling. Following hybridization of the primer to the 
denatured Hha I DNA fragment, E. coli DNA polymerase I 
Klenow fragment (33) was used to catalyze the repair 25 
synthesis of the plus (top) strand (FIG. 4). In addition, the 
associated 3' §5' exonuclease activity of the Klenow frag- 
ment removed the 3 '-protruding end from the minus (bot- 
tom) strand, leaving a flush end. Analysis of samples of the 
reaction mixture by polyacrylamide gel electrophoresis indi- 30 
cated that the repair synthesis did not go to completion, but 
stopped at several discrete sites. Therefore, the entire reac- 
tion mixture was treated with Pst I and the desired 141 bp 
fragment (180,000 Cerenkov cpm; ~0.3 pmole) was purified 
by polyacrylamide gel electrophoresis (FIG. 5). Ligation of 35 
this fragment to 1 ug (~4 pmole) of the 363 bp Pst I-Bgl n 
fragment isolated from pFTF3 (FIG. 4), followed by Bgl II 
digestion, yielded 50,000 Cerenkov cpm (~0.1 pmole, ~30 
ng) of the 504 bp DNA fragment containing the entire 
coding sequence for mature fibroblast interferon. The same 40 
reactions using primer II gave 83,000 cpm ("0.15 pmole, ~50 
ng) of 505 bp product. 

The construction of plasmids which direct the synthesis of 
human fibroblast interferon is outlined in FIG. 6. Separate 
expression plasmids were constructed which placed FIF 45 
synthesis under the control of the E. coli lac or trp promoter- 
operator systems. Both of these systems have proven useful 
for the direct expression of eukaryotic genes in E. coli: 
human growth hormone has been efficiently synthesized 
using the lac system (21) and human leukocyte interferon 50 
has been produced at high levels using the trp system (30) 
and Nature 287, 411 (1980). 

pBRH trp was digested with EcoRI restriction enzyme 
and the resulting fragment isolated by PAGE and electro- 
elution. EcoRI-digested plasmid pSom 11 (K. Itakura et al., 55 
Science 198, 1056 (1977); G.B. patent publication no. 2 007 
676 A) was combined with the above fragment. The mixture 
was ligated with T 4 DNA ligase as previously described and 
the resulting DNA transformed into E. coli K-12 strain 294 
as previously described. Transformant bacteria were 60 
selected on ampicillin-containing plates. Resulting ampicil- 
lin-resistant colonies were screened by colony hybridization 
(M. Gruenstein et al., Proc Nat'l Acad Sci USA 72, 
3951-3965 [1975]) using as a probe the trp promoter- 
operator-containing the above fragment isolated from 65 
pBRHtrp, which had been radioactively labelled with p 32 . 
Several colonies shown positive by colony hybridization 



were selected, plasmid DNA was isolated and the orientation 
of the inserted fragments determined by restriction analysis 
employing restriction enzymes Bgin and BamHI in double 
digestion. E. coli 294 containing the plasmid designated 
pSOM7A2, which has the trp promoter-operator fragment in 
the desired orientation was grown in LB medium containing 
10 ug/ml ampicillin. The cells were grown to optical density 
1 (at 550 nM), collected by centrifugation and resuspended 
in M9 media in tenfold dilution. Cells were grown for 2-3 
hours, again to optical density 1, then lysed and total cellular 
protein analyzed by SDS (sodium dodccyl sulfate) urea (15 
percent) polyacrylamide gel electrophoresis (J. V. Maizel Jr. 
et al., Meth Viral 5, 180-246 [1971]). 

Plasmid pBR.322 was Hind HI digested and the protruding 
Hind HJ ends in turn digested with SI nuclease. The SI 
nuclease digestion involved treatment of 10 ug of Hind 
Dl-cleaved pBR322 in 30 ul SI buffer (0.3M NaCl, 1 mM 
ZnCl 2 , 25 mM sodium acetate, pH 4.5) with 300 units SI 
nuclease for 30 minutes at 15° C. The reaction was stopped 
by the addition of 1 ul of 30xSl nuclease stop solution 
(0.8M tris base, 50 mM EDTA). The mixture was phenol 
extracted, chloroform extracted and ethanol precipitated, 
then EcoRI digested as previously described and the large 
fragment (1) obtained by PAGE procedure followed by 
electroelution. The fragment obtained has a first EcoRI 
sticky end and a second, blunt end whose coding strand 
begins with the nucleotide thymidine. 

Plasmid pSom7 A2, as prepared above, was Bgl II 
digested and the Bgl II sticky ends resulting made double 
stranded with the Klenow polymerase I procedure using all 
four deoxynucleotide triphosphates. EcoRI cleavage of the 
resulting product followed by PAGE and electroelution of 
the small fragment (2) yielded a linear piece of DNA 
containing the tryptophan promoter-operator and codons of 
the LE' "Proximal" sequence upstream from the Bgl II site 
("LE'(p)"). The product had an EcoRI end and a blunt end 
resulting from filling in the Bgl II site. However, the Bgl II 
site is reconstituted by ligation of the blunt end of the above 
fragment (2) to the blunt end of the above prepared fragment 
(1). Thus, the two fragments were ligated in the presence of 
T 4 DNA ligase to form the recirculated plasmid pHKY 10 
which was propagated by transformation into competent E. 
coli strain 294 cells. 

Plasmid pGMl carries the E. coli tryptophan operon 
containing the deletion ALE1413 (G. F. Miozzari, et al., 
(1978) J. Bacteriology 133, 1457-1466)) and hence 
expresses a fusion protein comprising the first 6 amino acids 
of the trp leader and approximately the last third of the trp 
E polypeptide (hereinafter referred to in conjunction as LE'), 
as well as the trp D polypeptide in its entirety, all under the 
control of the trp promoter-operator system. The plasmid, 20 
ug, was digested with the restriction enzyme PvuIT which 
cleaves the plasmid at five sites. The gene fragments were 
next combined with EcoRI linkers (consisting of a self 
complementary oligonucleotide of the sequence: pCAT- 
GAATTCATG) providing an EcoRI cleavage site for a later 
cloning into a plasmid containing an EcoRI site. The 20 ug 
of DNA fragments obtained from pGMl were treated with 
10 units T 4 DNA ligase in the presence of 200 pico moles of 
the 5'-phosphorylated synthetic oligonucleotide pCAT- 
GAATCATG and in 20 ul T 4 DNA ligase buffer (20 mM tris, 
pH 7.6, 0.5 mM ATP, 10 mM MgCl 2 , 5 mM dithiothrcitol) 
at 4° C. overnight. The solution was then heated 10 minutes 
at 70° C. to halt ligation. The linkers were cleaved by EcoRI 
digestion and the fragments, now with EcoRI ends were 
separated using 5 percent polyacrylamide gel electrophore- 
sis (hereinafter "PAGE") and the three largest fragments 
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isolated from the gel by first staining with ethidium bromide, 
locating the fragments with ultraviolet light, and cutting 
from the gel the portions of interest. Each gel fragment, with 
300 microliters O.lxTBE, was placed in a dialysis bag and 
subjected to electrophoresis at 100 v for one hour in 0.1 x 5 
TBE buffer (TBE buffer contains: 10.8 gm tris base, 5.5 gm 
boric acid, 0.09 gm Na 2 EDTA in 1 liter H 2 0). The aqueous 
solution was collected from the dialysis bag, phenol 
extracted, chloroform extracted and made 0.2M sodium 
chloride, and the DNA recovered in water after ethanol 10 
precipitation. The tip promotcr-opcrator-containing gene 
with EcoRI sticky ends was identified in the procedure next 
described, which entails the insertion of fragments into a 
tetracycline sensitive plasmid which, upon promoter-opera- 
tor insertion, becomes tetracycline resistant. 15 

Plasmid pBRHl (R. I. Rodriguez, et al. Nucleic Acids 
Research 6, 3267-3287 [1979]) expresses ampicilin resis- 
tance and contains the gene for tetracycline resistance but, 
there being no associated promoter, does not express that 
resistance. The plasmid is accordingly tetracycline sensitive. 20 
By introducing a promoter-operator system in the EcoRI 
site, the plasmid can be made tetracycline resistant. 

pBRHl was digested with EcoRI and the enzyme 
removed by phenol extraction followed by chloroform 
extraction and recovered in water after ethanol precipitation. 25 
The resulting DNA molecule was, in separate reaction 
mixtures, combined with each of the three DNA fragments 
obtained above and ligated with T 4 DNA ligase as previ- 
ously described. The DNA present in the reaction mixture 
was used to transform competent E. coli K-12 strain 294, K. 30 
Backman et al., Proc Nat'l Acad Sci USA 73, 4174-4198 
[1976]) by standard techniques (V. Hershfield et al., Proc 
Nat' 1 Acad Sci US A 7 1 , 3455-3459 [ 1 974]) and the bacteria 
plated on LB plates containing 20 |ig/ml ampicillin and 5 
ug/ml tetracycline. Several tectracycline-resistant colonies 35 
were selected, plasmid DNA isolated and the presence of the 
desired fragment confirmed by restriction enzyme analysis. 
The resulting plasmid is designated pBRHtrp. 

An EcoRI and BamHI digestion product of the viral 
genome of hepatitis B was obtained by conventional means 40 
and cloned into the EcoRI and BamHI sites of plasmid 
pGII6 (D. V. Goeddel et al., Nature 281, 544 [1979])) to 
form the plasmid pHS32. Plasmid pHS32 was cleaved with 
Xbal, phenol extracted, chloroform extracted and ethanol 
precipitated. It was then treated with 1 ul E. coli polymerase 45 
I, Klenow fragment (Boehringer-Mannheim) in 30 ul poly- 
merase buffer (50 mM potassium phosphate pH 7.4, 7 mM 
MgCl 2 , 1 mM p-mercaptoethanol) containing 0. 1 mM dTTP 
and 0.1 mM dCTP for 30 minutes at 0° C. then 2 hr. at 37° 
C. This treatment causes 2 of the 4 nucleotides complemen- 50 
tary to the 5' protruding end of the Xbal cleavage site to be 
filled in: t,0170 

Two nucleotides, dC and dT, were incorporated giving an 
end with two 5' protruding nucleotides. This linear residue 
of plasmid pHS32 (after phenol and chloroform extraction 55 
and recovery in water after ethanol precipitation) was 
cleaved with EcoRI. The large plasmid fragment was sepa- 
rated from the smaller EcoRI-Xbal fragment by PAGE and 
isolated after electroelution. This DNA fragment from 
pHS32 (0.2 ug), was ligated, under conditions similar to 60 
those described above, to the EcoRI-Taq I fragment of the 
tryptophan operon (~0.01 ug), derived from pBRHtrp. 

In the process of ligating the fragment from pHS32 to the 
Eco RI-Taq I fragment,.. as described above, the Taq I 
protruding end is ligated to the Xbal remaining protruding 65 
end even though it is not completely Watson-Crick base- 
paired: t,0180 
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A portion of this ligation reaction mixture was trans- 
formed into E. coli 294 cells, heat treated and plated on LB 
plates containing ampicillin. Twenty-four colonies were 
selected, grown in 3 ml LB media, and plasmid isolated. Six 
of these were found to have Xbal site regenerated via E. coli 
catalyzed DNA repair and replication: t,0181 

These plasmids were also found to cleave both with 
EcoRI and Hpal and to give the expected restriction frag- 
ments. One plasmid, designated pTrp 14, was used for 
expression of heterologous polypeptides, as next discussed. 

The plasmid pHGH 107 (D. V. Goeddel et al, Nature, 281 , 
544, 1979) contains a gene for human growth hormone made 
up of 23 amino acid codons produced from synthetic DNA 
fragments and 163 amino acid codons obtained from 
complementary DNA produced via reverse transcription of 
human growth hormone messenger RNA. This gene, though 
it lacks the codons of the "pre" sequence of human growth 
hormone, does contain an ATG translation initiation codon. 
The gene was isolated from 10 ug pHGH 107 after treatment 
with EcoRI followed by E. coli polymerase I Klenow 
fragment and dTTP and dATP as described above. Following 
phenol and chloroform extraction and ethanol precipitation 
the plasmid was treated with BamHI. 

The human growth hormone ("HGH") gene-containing 
fragment was isolated by PAGE followed by electroelution. 
The resulting DNA fragment also contains the first 350 
nucleotides of the tetracycline resistance structural gene, but 
lacks the tetracycline promoter-operator system so that, 
when subsequently cloned into an expression plasmid, plas- 
mids containing the insert can be located by the restoration 
of tetracycline resistance. Because the EcoRI end of the 
fragment has been filled in by the Klenow polymerase 1 
procedure, the fragment has one blunt and one sticky end, 
ensuring proper orientation when later inserted into an 
expression plasmid. 

The expression plasmid pTrpl4 was next prepared to 
receive the HGH gene-containing fragment prepared above. 
Thus, pTrpl4 was Xbal digest and the resulting sticky ends 
filled in with the Klenow polymerase I procedure employing 
dATP, dTTP, dGTP and dCTP. After phenol and chloroform 
extraction and ethanol precipitation the resulting DNA was 
treated with BamHI and the resulting large plasmid fragment 
isolated by PAGE and electroelution. The pTrpl4-deriveQ 
fragment had one blunt and one sticky end, permitting 
recombination in proper orientation with the HGH gene 
containing fragment previously described. 

The HGH gene fragment and the pTrpl4 AXba-BamHI 
fragment were combined and ligated together under condi- 
tions similar to those described above. The filled in Xbal and 
EcoRI ends ligated together by blunt end ligation to recreate 
both the Xbal and the EcoRI site: t,0190 

This construction also recreates the tetracycline resistance 
gene. Since the plasmid pHGH 107 expresses tetracycline 
resistance from a promoter lying upstream from the HGH 
gene (the lac promoter), this construction, designated pHGH 
207, permits expression of the gene for tetracycline resis- 
tance under the control of the tryptophan promoter-operator. 
Thus the ligation mixture was transformed into E. coli 294 
and colonies selected on LB plates containing 5 ug/ml 
tetracycline. 

Plasmid pHGH 207 was EcoRI digested and the trp 
promoter containing EcoRI fragment recovered by PAGE 
followed by electroelution. Plasmid pBRHl was EcoRI 
digested and the cleaved ends treated with bacterial alkaline 
phosphatase ("BAP") (1 ug, in 50 mM tris pH 8 and 1 0 mM 
MgCl 2 for 30 min. at 65° C.) to remove the phosphate 
groups on the protruding EcoRI ends. Excess bacterial 
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alkaline phosphatase was removed by phenol extraction, 
chloroform extraction and ethanol precipitation. The result- 
ing linear DNA, because it lacks phosphates on the protrud- 
ing ends thereof, will in ligation accept only inserts whose 
complementary stick ends are phosphorylated but will not 5 
itself recircularize, permitting more facile screening for 
plasmids containing the inserts. 

The EcoRI fragment derived from pHGH 207 and the 
linear DNA obtained from pBRHl were combined in the 
presence of T 4 ligase as previously described and ligated. A 
portion of the resulting mixture was transformed into E. coli 
strain 294 as previously described, plated on LB media 
containing 5 ug/ml of tetracycline, and 12 tetracycline 
resistant colonies selected. Plasmid was isolated from each 
colony and examined for the presence of a DNA insert by 
restriction endonuclease analysis employing EcoRI and 15 
Xbal. One plasmid containing the insert was designated 
pHKYl. 

The plasmid pHKYlO, described above, is a derivative of 
pBR322 which contains a Bgl II site between the tetracy- 
cline resistance (Tc*) promoter and structural gene (32). The 20 
large DNA fragment isolated after digesting pHKYlO with 
Pst I and Bgl II therefore contains part of the ampicillin 
resistance (Ap*) gene and all of the Tc* structural gene, but 
lacks the Tc* promoter (FIG. 6). The plasmid pGH6 (21) 
was digested with Eco RI, the resulting single stranded ends 25 
were filled in with DNA polymerase I, and the plasmid was 
cleaved with Pst I. The small fragment, containing part of 
the Ap s gene, a double lac promoter and lac ribosome 
binding site, but lacking an ATG initiation triplet was 
isolated. A similar tip promoter fragment, containing the trp 30 
leader ribosome binding site, but lacking an ATG sequence 
(30), may be isolated from pHKYl described above; see 
(32) (see FIG. 6). 

The trp fragment just referred to is an analog of the E. coli 
tryptophan operon from which the so-called trp attenuator 35 
has been deleted, See J. Bact. 133, 1457 (1978), to control- 
lably heighten expression levels. Expression plasmids con- 
taining the modified trp regulon can be grown to predeter- 
mined levels in nutrient media containing additive 
tryptophan in quantities sufficient to repress the promoter- 40 
operator system, then be deprived of tryptophan so as to 
derepress the system and occasion the expression of the 
intended product. 

The expression plasmids may be assembled via three part 
ligation reactions as shown in FIG. 6. 15 ng ("0.05 pmole) 45 
of the assembled FIF gene (504 or 505 bp), 0.5 ug ("0.2 
pmole) of the large Pst I-Bgl II fragment of pHKYlO and 0.2 
ug ("0.3 pmole) of the appropriate promoter fragment were 
ligated and the mixture used to transform E. coli 294 (22). 
Plasmid DNA was prepared from individual transformants 50 
and analyzed by restriction mapping. Correct joining of the 
assembled gene to the promoter fragment should restore the 
Eco RI (lac) or the Xba I (trp) recognition sequences. The 
majority of the plasmids gave the expected restriction 
enzyme digestion patterns. Individual clones (12 containing 55 
the trp promoter and 12 containing the lac promoter) were 
grown and extracts prepared for interferon assay as 
described in Materials and Methods. 

When assayed on human amnion (WISH) cells for anti- 
viral activity by the CPE inhibition assay (1) five of the trp 60 
transformants were positive (each approximately equiva- 
lent); eleven of the lac transformants gave equivalent IF 
activities. Therefore, one transformant from each series 
(pFIFlac9 and pFIFtrp69) was selected for further study 
(Table 1). DNA sequence analysis demonstrated that the 65 
desired attachment of promoter to FIF structural gene had 
occurred in both cases. t,0210 



The amounts of fibroblast interferon produced by 
pFIFlac9 and pFTFtrp69 are shown in Table 1. The trp 
promoter gave a FIF expression level measurably higher 
than did the lac promoter. In an attempt to further increase 
FIF expression levels, pFIFtrp69 was cleaved with Eco RI 
and two 300 base pair Eco RI fragments containing the trp 
promoter (30) were inserted. The resulting plasmid, 
pFIFtrp 3 69, contains three successive trp promoters which 
read toward the FIF gene. The amount of FIF synthesized by 
E. coli K-12 strain 294/pFIF trp 3 69 is 4-5 times that 
produced by pFIF trp 69 (Table 1). This is apparently due to 
the derepression of the trp promoter which occurs when trp 
repressor levels are titrated by the multiple copies of the trp 
operator. 

The FIF produced by E. coli K-12 strain 294/pFIFtrp69 
behaves like authentic human FTF. As shown in Table 2, its 
antiviral activity is about 30 times greater on human cells 
than bovine cells. In addition, the bacterially produced FTF 
is stable to treatment at pH 2 overnight and is not neutralized 
by rabbit antihuman leukocyte interferon antibodies (Table 
3). t,0220 t,0221 
Purification 

The purification procedure for bacterial derived fibroblast 
is as follows: 

1. Frozen cells are suspended in twelve times volume per 
weight with sucrose lysis buffer (100 mM Tris-HCl, 10 
percent sucrose, 0.2M NaCl, 50 mM EDTA, 0.2 mM 
PMSF, pH 7.9) containing lysozyme at 1 mg ml -1 . The 
cell suspension is stirred for 1 hour at 4° C. and centri- 
fuged. Fibroblast interferon activity remains in the super- 
natant. 

2. Polyethyleneimine (5 percent v/v) is added to the soni- 
cated supernatant to a final concentration of 0.5 percent 
(v/v). The solution is stirred for 1 hour at 4° C. and 
centrifuged. Interferon activity remains in the superna- 
tant. 

3. Solid ammonium sulfate is added to the polyethylene- 
imine supernatant to a final concentration of 50 percent 
saturation, stirred for 30 minutes at 4° C. and centrifuged. 
Interferon activity is in the 50 percent pellet. 

4. The 50 percent ammonium sulfate pellet is suspended in 
one half the volume of the 50 percent ammonium sulfate 
suspension with Phosphate Buffered Saline (20 mM 
sodium phosphate 0.15M NaCl, pH 7.4). Polyethylene 
glycol 6000 (50 percent w/v in PBS) is added to a final 
concentration of I2V2 percent (v/v), stirred at 4° C. for 2 
hours and centrifuged. Interferon activity is in the pellet. 
The pellet is suspended in a minimal volume of sucrose 
lysis buffer and clarified by centrifugation. 

This initial extraction procedure results in a purification of 
fibroblast interferon from 0.001 percent of the total protein 
to 0.05 percent of the total protein. This material can be 
further purified to homogeneity by the following column 
chromatography steps: 

5. Afinity chromatography on Amicon Blue B in sucrose 
lysis buffer. 

6. Anion exchange chromatography on QAE Sephadex in 
sucrose lysis buffer in the absence of 0.2M NaCl. 

7. Size exclusion chromatography on Sephadex G-75 in 
sucrose lysis buffer. 

8. Reverse phase high pressure liquid chromatography. 
Parenteral Administration 

FIF may be parenterally administered to subjects requir- 
ing antitumor or antiviral treatment. Dosage and dose rate 
may parallel that currently in use in clinical investigations of 
human derived materials, e.g., about (1-I0)xl0 6 units daily, 
and in the case of materials of purity greater than 1 percent- 
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age, likely up to, e.g., 150xl0 6 units daily. Dosages of 
bacterially obtained FIF could be significantly elevated for 
greater effect owning to the essential absence of human 
proteins other than FIF, which proteins in fibroblast-derived 
materials may act as pyrogens, exhibiting adverse effects, 5 
e.g., malaise, temperature elevation, etc. 

As one example of an appropriate dosage form for essen- 
tially homogeneous bacterial FIF in parenteral form, 3 mg. 
FIF of specific activity of, say, 2x10 s u/mg may be dissolved 
in 25 ml. 5 percentage serum albumin (human) - USP, the 10 
solution passed through a bacteriological filter and the 
filtered solution aspetically subdivided into 100 vials, each 
containing 6xl0 6 units pure interferon suitable for parenteral 
administration. The vials are preferably stored in the cold 
(-20° C.) prior to use. 15 

The compounds of the present invention can be formu- 
lated according to known methods to prepare pharmaceuti- 
cally useful compositions, whereby the polypeptide hereof is 
combined in admixture with a pharmaceutically acceptable 
carrier vehicle. Suitable vehicles and their formulation are 20 
described in Remington' s Pharmaceutical Sciences by E. W. 
Martin, which is hereby incorporated by reference. Such 
compositions will contain an effective amount of the inter- 
feron protein hereof together with a suitable amount of 
vehicle in order to prepare pharmaceutically acceptable 25 
compositions suitable for effective administration to the 
host. One preferred mode of administration is parenteral. 
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We claim: 

I. A composition comprising water and a nonglycosylated 
polypeptide having the amino acid sequence of a mature 
human fibroblast interferon, said nonglycosylated polypep- 
tide having a total of 165 or 166 amino acids and said 
composition being free of any glycosylated human fibroblast 
interferon. 

2. The composition of claim 1, said nonglycosylated 
polypeptide having the amino acid sequence 

X-Ser-Tyr-Asn-Leu-Leu-Gly-Phe-Leu-Gln-Arg-Ser-Ser- 
Asn-Phe-Gln-Cys-Gln-Lys-Leu-Leu-Trp-Gln-Leu- 
Asn-Gly-Arg-Leu-Glu-Tyr-Cys-Leu-Lys-Asp-Arg- 
Met-Asn-Phe-Asp-Ile-Pro-Glu-Glu-Ile-Lys-Gln-Lcu- 
Gln-Gln-Phe-Gln-Lys-Glu-Asp-Ala-Ala-Leu-Thr-Ilc- 
Tyr-Glu-Met-Leu-Gln-Asn-Ile-Phe-Ala-Ile-Phc-Arg- 
Gln-Asp-Ser-Ser-Ser-Thr-Gly-Trp-Asn-Glu-Thr-Ile- 
Val-Glu-Asn-Leu-Leu-Ala-Asn-Val-Tyr-His-Gln-Ilc- 
Asn-His-Leu-Lys-Thr-Val-Leu-Glu-Glu-Lys-Leu-Glu- 
Lys-Glu-Asp-Phe-Thr-Arg-Gly-Lys-Leu-Met-Ser-Scr- 
Leu-His-Leu-Lys-Arg-Tyr-Tyr-Gly-Arg-Ile-Leu-His- 
Tyr-Leu-Lys-Ala-Lys-Glu-Tyr-Ser-His-Cys-Ala-Trp- 
Thr-Ile-Val-Arg-Val-Glu-Ile-Leu-Arg-Asn-Phe-Tyr- 
Phe-Ile-Asn-Arg-Leu-Thr-GIy-Tyr-Leu-Arg-Asn, 
wherein X is H or Met. 

3. The composition of claim 2, said nonglycosylated 
polypeptide having a formula molecular weight of about 
20,027. 
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4. The composition of claim 1, 2 or 3, said composition 
being free of human proteins. 

5. The composition of claim 1, 2 or 3, said composition 
containing a therapeutically effective amount of said nong- 
lycosylated polypeptide and being suitable for parenteral 
administration. 
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6. The composition of claim 4, said composition contain- 
ing a therapeutically effective amount of said nonglycosy- 
lated polypeptide and being suitable for parenteral admin- 
istration. 
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Table 1. 



Interferon activity in extracts of E. col i 



E. coli K-12 
strain 294 
transformed by: 



Cell density 
(cells/ml) 



IF Activity 
(units/1 culture) 



FIF molecules 
per eel 1 



pBR322 

pFIF1ac9 

pFIFtrp69 



3.5 x 10 8 
3.5 x 108 
3.5 x 10 8 



pFIFtr£ 3 69 3.5 x 108 



9.0 x 10 5 
1.8 x 10 7 

8.1 x 10 7 



2,250 
4,500 
20,200 
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Table 2. Interferon activities measured on different cell types 





Interfero 


n Activity 


(uni ts/ml ) 


Cells 


LeIF 


FIF 


E. coli K-12 strain 
294/pFIFtrpS9 extract 


Human amnion 


20,000 


10,000 


1280 


Bovine kidney 


13,000 


400 


40 



LeIF and FIF were HI H standard solutions haying 20,000 units/ml and 
10,000 units/ml respectively. Assays were performed as described in 
Materials and Methods. 
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Table 3 . Comparison of activities of extracts from E. col i K-12 

strain 294/pFIFtrp69 with standard human leukocyte and 
fibroblast interferons 



Interferon Activity (units/ml) 
LeIF 



FIF 



E. coli K-12 strain 
?947pTTFtrp69 



untreated 


1000 


1000 


1000 


pH2 


1000 


1000 


1000 


rabbit antihuman 


<16 


1000 


1000 


LeIF antibodies 
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of specific chromosomal regions, including gene dusters encompassing neuron-specific 
genes, some of which do not themselves contain REST/NRSF response elements. ^ 

Step-by-Step Assembly 

A family of RNA-protein complexes, known as the uridine-rich small nuclear ribonucleopro- 
teins (U snRNPs), form the core of the spliceosome that excises introns and ligates ^exons to 
form messenger RNA. A functional deficiency in the survival of motor neurons (SMN pro- 
tein results in spinal muscular atrophy, a disease in which motor neurons of the spina cord 
degenerate. Pellizzoni et aL (p. 1775) show that the SMN complex (of which the SMN pro- 
tein is a part) serves to assemble in an orderly fashion the protein ingredients of U snRNPs 
(the Sm proteins) onto the U snRNAs. It first binds the Sm proteins, then the U snRNAs, 
and finally puts them together in an adenosine-triphosphate (ATP-dependent) reaction. 

Neurotrypsin and Mental Retardation 

Inherited mental retardation (MR) is often linked to abnormalities on the X chromo- 
some or to abnormalities in brain development or other clinically identifiable features, 
but in most cases none of these attributes are present. An analysis of such nonsyn- 
dromic MR patients by Molinari ef aL (p. 1779) revealed an association with the muta- 
tion of the serine protease neurotrypsin. In situ hybridization studies of the expression 
of neurotrypsin during normal development revealed that it is expressed in parts of the 
brain associated with learning and memory and first appears at 44 days of develop- 
ment Immunoelectron microscopy localized neurotrypsin at presynaptic nerve endings. 
Although this mutation does not appear to be a common cause of MR, further studies 
may yield insights into the pathways leading to these diseases. 

Neurotoxic, Cytosolic Prion Proteins 

The characteristics of the prion protein, which has been linked to a variety of neurodegen- 
erative disorders, is the subject of two reports. Ma and Lindquist (p. 1785) reveal how inhi- 
bition of the proteasome machinery in cells that produce prion proteins can lead to the ac- 
cumulation of prion isoforms in the cy- 
9t<as j/H«k<! tosol, and how, under certain conditions, 

misfolded self-perpetuating isoforms can 
be generated de novo. Ma ef. aL (p. 1781) 
examined the effects of retrograde- 
transported or cytosolically expressed 
prion protein in a transgenic mouse 
model and in neuronal cell lines. Cytoso- 
lic prion protein was highly neurotoxic, 
and mice engineered to possess cytoso- 
lic prion protein developed severe ataxia, 
cerebellar degeneration, and gliosis. ^ 

The Hydrogen Economy of the Cut 

A major cause of gastritis, peptic ulcers, and certain cancers is the bacterium Helicobac- 
ter pylori. Olson and Maier (p. 1788} show that the colonization success of this com- 
mon pathogen is boosted by hydrogen gas produced by other intestinal occupants. 
Molecular hydrogen in the mucous lining of the stomach stimulates the pathogen to 
produce more of a constitutive enzyme, hydrogenase, required to harvest the energy 
through a series of heme-containing electron carriers. 

Clycosylation in Bacteria 

N-linked glycosylation is a common posttranslational modification of membrane and 
secretory proteins in eukaryotes. However, the only bacterium known to modify pro- 
teins in this way is Campylobacter jejuni, and thus many eukaryotic proteins generated 
in bacterial systems are of limited use because they lack appropriate modifications. 
Wacker et ai. (p. 1790) have transplanted the N-linked glycosylation machinery from C. 
jejuni to Escherichia coti, which should enhance the opportunities for the large-scale 
production of appropriately modified proteins in bioreaetors. 
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Suppression of Human Interferon Production by Inhibitors 

of Glycosylation 
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Department of Microbiology, New York University School of Medicine, New York, New York 10016 

Accepted September 27, 1974 

Treatment of diploid human foreskin cell cultures with either 2-deoxy-D-glucose or 
D-glucosamine resulted in dose -dependent inhibition of interferon production induced with 
polyinosinate-polycytidylate [poly(I) . poly(C)]. This inhibitory action was readily demon- 
strable even if the addition of the inhibitor was delayed until 1.5 hr after induction The 
appearance of intracellular interferon activity was also inhibited, suggesting that synthesis 
of biologically active interferon, and not its secretion from the cell, was the main target of 
the inhibitory action. The inhibitory action of 2-deoxy-D-glucose on interferon production 
was completely reversible by D-mannose while the action of D-glucosamine was only slight y 
affected by the addition of mannose. Interferon produced in the presence of a partially 
inhibitory concentration of 2-deoxy-D-glucose was more heat-labile than control interferon. 
Residual active interferon made in the presence of D-glucosamine was less efficiently 
neutralized by anti-interferon antibody than control interferon. The results are compatible 
with the idea that the two inhibitors suppress the production of biologically active 
interferon by virtue of their interfering with the proper glycosylation of the molecule. 



INTRODUCTION 

The evidence that interferons are glyco- 
proteins has recently been reviewed by 
Weil and Dorner (1973). Schonne et al 
(1970) were the first to show that treatment 
of rabbit interferon preparations with neur- 
aminidase, while not affecting biological 
activity, abolished the characteristic 
charge heterogeneity of interferon demon- 
strable by isoelectric focusing. This finding 
led the authors to propose that rabbit 
interferon is a glycoprotein with a variable 
content of terminal sialic acid residues. 

Dorner et al. (1973) carried this work a 
step further by showing that rabbit inter- 
feron treated with neuraminidase (which 
banded in a single peak at pH 6.3) regained 
its original charge heterogeneity after rein- 
corporation of sialic acid residues by the 
ase of sialyl transferase from rat liver. 
These authors also showed that the carbo- 

1 Permanent address: Institut du Radium, Paris, 
France. Recipient of Eleanor Roosevelt Fellowship 
from International Union Against Cancer. 



hydrate portion of rabbit interferon con- 
tains at least one terminal sequence, sialic 
acid-galactose-. 

The multiplication of several enveloped 
viruses was shown to be inhibited by 
2-deoxy-D-glucose or D-glucosamine. The 
list of viruses shown to be inhibited by one 
or both of these compounds includes influ- 
enza (Kilbourne, 1959; Gandhi et al, 1972; 
Klenk et al, 1972), Sindbis and Semliki 
Forest (Kaluza et al., 1972, 1973), Newcas- 
tle disease (Samson and Fox, 1974; Schol- 
tissek et al, 1974), herpes simplex (Court- 
ney et al, 1973) and avian sarcoma 
(Hunter et al, 1974) viruses. Most availa- 
ble evidence suggests that this suppression 
of virus multiplication is due to selective 
interference with normal glycosylation of 
viral envelope glycoproteins resulting in 
the derangement of their processing. 

These reports prompted us to examine 
the effect of 2-deoxy-D-glucose and D- 
glucosamine on interferon synthesis, to 
determine if, in this instance too, these 



Copyright © 1975 by Academic Press, Inc. 



475 

Board Assigned Page #2209 



JAN 29 2007 15:34 FR CISTI ICIST 



613 952 9303 TO 12122308888 



P. 04/11 



476 



HAVELL et al. 



compounds might exert a selective effect 
on glycosylate, and to obtain new infor- 
mation about the function of the carbohy- 
drate portion of the interferon molecule. 



MATERIALS AND METHODS 
Cell cultures. A human diploid cell 
strain (PS-4), derived in this laboratory 
from the foreskin of a single neonate and 
identified as a high interferon producer 
(Vilcek and Havell, 1973), was used 
throughout- All experiments were carried 
out in cultures grown in 60-mm plastic 
petri dishes, unless specified otherwise. 
Each dish was seeded with a total ol 
200,000 cells in 5 ml of medium consisting 
of 'Eagle's minimal essential medium 
(MEM) buffered with JV-2-hydroxyethyl- 
piperazine - N' - 2 - ethanesulfomc acid 
(hepes, 6 mM), JV-tris-hydroxymethyl- 
methyl glycine (tricine, 13 mM) and so- 
dium bicarbonate (1.1 g/liter), supple- 
mented with 5% heated fetal calf serum. 
The medium was changed for fresh growth 
medium on the 6th day and the cultures 
were used for experiments 11-13 days 'after 
seeding. The incubation was in a humidi- 
fied C0 2 incubator at 36°, unless specified 
otherwise. 

Chemicals . Polyinosinate-polycytidy- 
late [polyd)-poly(C)] was supplied by the 
Antiviral Substances Program, Infectious 
Disease Branch, National Institute of Al- 
lergy and Infectious Diseases, Bethesda, 
MD D-Glucosamine r 2-deoxy-D-glucose, D- 
mannose, and iV-acetyl-D-glucosarmne 
were purchased from Sigma Chemical Co., 
St. Louis, MO. Solutions of the sugars were 
prepared in water, sterilized by membrane 
filtration, and stored at 4°. Cyctoheximide 
was obtained from the Upjohn Co., Kala- 
mazoo, MI, and actinomycin D from Calbi- 
ochem, Los Angeles, CA. 

Interferon titrations. The microassay tor 
human interferon was described in detail 
earlier (Havell and Vilcek, 1972). In short, 
wells of the Micro Test II plastic tissue 
culture plates (Falcon Plastics, Los Ange- 
les, CA, or Linbro, New Haven, CT) were 
filled with 100 fi\ of MEM containing 5% 
fetal calf serum and duplicate serial two- 
fold dilutions of the assayed materials were 
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prepared with the aid of an Eppendorf 
micropipette. To each well was then added 
a suspension of human foreskin cells j 
(30,000 cells/well in 100 til of medium, as ' 
above). After 18-20 hr of incubation, 1,000 
PFU of the Indiana type vesicular stomati- 
tis virus was added to each well and a final 
reading of the test was taken by micro, 
scopic examination about 48 hr after virus 
inoculation. The highest dilution of the 
assayed sample which protected at least', 
50% of the cell sheet from the cytopathic | 
action of the virus was taken as the end 
point. Each assay included the titration of ' 
an interferon standard that had been cali- 
brated against the 69/19 reference standard 
for human interferon (obtained from Anti- 
viral Substances Program, Infectious Dis- j 
ease Branch, National Institute of Allergy 
and Infectious Diseases, Bethesda, MD). 
All interferon titers throughout the paper 
are expressed in terms of reference units/ 

Preparation of high-titered interferon by < 
superinduction. Dish-cultures of FS-4 cells 
were incubated with poly (I) -poly(C) (100 
ug/ml) and cycloheximide (50 /ig/ml) m 2 
ml of serum-free MEM. Actinomycin D 
(final concentration 1 pg/ral) was added 
to the cultures 4 hr after the onset of this 
treatment. At 6 hr (i.e., 2 hr after the 
addition of actinomycin), medium with the 
inducer and the inhibitors was removed, 
the cells were thoroughly washed and re- 
plenished with 2 ml of MEM containing 2% 
fetal calf serum. Culture fluids were har- 
vested 30 hr after induction. The incuba- 
tion temperature throughout this proce-^ 
dure was 34° (Vilcek and Havell, 1973). j 
Preparation of rabbit antiserum against] 
interferon. Interferon prepared in serum- 
free MEM by superinduction of roller- 
bottle cultures of FS-4 cells and concen- 
trated by ultrafiltration was employed ior f 
immunization. Three New Zealand white , 
female rabbits received a total of three t 
subcutaneous injections in Freund's com- 
plete adjuvant, seven injections in, 
Freund's incomplete adjuvant and, finally, 
eight injections without adjuvant. A total , 
of 32 000 units of interferon was adrmnis- ^ 
tered with each injection. Serum was ob -( 
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»nrt n( ;«ined from blood drawn from the ear 
add? fns heated at 56° for 30 min, and ab- 
' V Sed for 1 hr with normal FS-4 cells 
im jfhout 4 x 10 6 cells/ml serum) in order to 
, 1,000 remove antibody against cellular surface 

Xl^^o. aSSay ° f 
micr ! Antiserum. An adaptation of the rmcroas- 
r virus av for human interferon was employed for 
of th he titration of neutralizing activity of 
t W( ^-interferon sera. Micro Test II plastic 
>pathi c (tissue culture plates were first seeded with 
he en A suspension of human foreskin cells 
ition of 25,000 cells/well) in 0.2 ml of] MEM . con- 
mcali- tuning 5% fetal calf serum. After 18 hr 
incubation the medium was aspirated trom 
tn e wells and replaced with 0.2 ml of a 
fixture of serial twofold dilutions of the 
antiserum plus a constant dose of the 
tested interferon sample (10 units/ml, final 
concentration). Prior to adding it to the 
we lls, the mixture of antiserum dilutions 
and interferon had been incubated for 1 hr 
, a t 37° in order to allow neutralization to 
'take place. After 18-20-hr incubation the 
antibody-interferon mixture-containing 
media were aspirated and replaced with 
025 ml of MEM containing 5% fetal calf 
serum and 1,000 PFU of vesicular stomati- 
tis virus. The assay was read 48 hr after 
LW v^jvirus inoculation. The neutralizing titer 
with the |was the highest dilution of the serum that 
smoved, I inhibited the antiviral action of interferon 
and re- as determined by the presence of cyto- 
ining2* pathic effect. Each sample was assayed m 
ere har- duplicate or triplicate, with adequate con- 
incuba- trols for antiserum, interferon and virus, 
s proce- Incorporation of [ Xi C\leucine. FS-4 cul- 
1973). tures in 35-mm plastic petri dishes were 
. against incubated with 0.4 n& of uniformly la- 
i serumlbeled L-[ 14 C]leucine (0.25 mCi/0.12 mg; 
f roller- New England Nuclear Corp., Boston, MA) 
concen- in l ml complete MEM with 2% fetal calf 
loyed for serum. Incorporation was stopped after 60 
id whitejmin by washing the cells three times with 
of three' ice-cold buffered saline. The _ cells ^ were 
d's com- 
.ions in 
I, finally. 
.. A total 
adminis 
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times with 5 ml each of 5% TCA and rinsed 
once with 95% EtOH. Acid-insoluble radio- 
activity was measured in a liquid scintilla- 
tion counter and expressed as cpm/culture. 
Preliminary experiments showed that ex- 
pressing the results as cpm/mg of protein 
did not improve the accuracy of measure- 
ment. 

Treatments with 2-deoxy-D-glucose or 
^-glucosamine. These were always carried 
out in complete MEM, i.e., in the presence 
of 5.5 mM D-glucose. Glucose was retained 
in the medium because it is the major 
carbon and energy source. 

RESULTS 

Dose Response and Kinetics of Inhibition 
Treatment of FS-4 cell cultures with 
either 2-deoxy-D-glucose or D-glucosamine 
resulted in a dose-dependent inhibition of 
the interferon yield, as determined in cul- 
ture fluids collected 6 hr after induction 
with poly(I) poly(C). Both compounds 
were about equally effective in inhibiting 
the production of biologically active inter- 
feron (Fig. 1). . 

Previous studies showed that interteron 
production in this system is sensitive to 
inhibition by actinomycin D only within 
the first hour after induction, indicating 
that the bulk of interferon mRNA synthe- 
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then lysed in 2 ml of 0.1 JV NaOH and 
heated for 30 min at 36°. Lysates were 
made to 10% with trichloroacetic acid 
(TCA); the resulting precipitates were col- 
Lui^xi-Jected on Whatman GF/C glass-fiber filters 
was oH (Reeve Angle, Clifton, NJ), washed four 



FlG. 1. Effect of various concentrations of 2-deoxy- 
D-glucose and D-glucosamine on the interferon yield. 
Cultures of FS-4 cells were stimulated to produce 
interferon by 30-min incubation with poly(I) poly(C) 
(100 Mg/ml) in 2 ml of MEM. The cultures were 
thoroughly washed thereafter and replenished with 2 
ml of MEM plus 2% fetal calf serum containing the 
inhibitors as indicated. All culture fluids were col- 
lected 6 hr after exposure to the inducer, dialyzed and 
assayed for interferon. 
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sis is completed by 1.5 hi (Vilcek and 
Havell, 1973, 1974). When the ime o 
addition of 2-deoxy-D-glucose to cultures 
was delayed until 1.5 hr after exposure of 
cells to poly(I) poly(C) the resulting inhi- 
bition of interferon production w* ^similar 
to that seen in cultures that had been 
treated with the inhibitor from the > time o 
exposure to the inducer (Table 1). Reversal 
of ?he inhibition at 8 hr after induction did 
not result in a subsequent release of appre- 
ciable quantities of interferon. These re- 
sults indicate that 2-deoxy-D-glucose inter- 
feres irreversibly with a posttranscnptional 
event in interferon synthesis. 

Inhibition of Intracellular Appearance of 
Interferon 

In cells stimulated with poly(I) poly(C), 
intracellular interferon is detectable about 
30 min prior to its appearance in the 
extracellular fluid (Tan et ah, 1971; Ng et 
al 1972) The process of interferon release 
is 'temperature dependent and energy de- 
pendent but, unlike interferon synthesis, it 
does not require protein synthesis (Tan et 
al, 1972). 

Treatment with 2-deoxy-D-glucose re- 
duced the amount of interferon recoverable 
from extracts of induced cells (Fig. 

TABLE 1 

Effect of Varying the Time of Treatment with 
2-Deoxy-d-Glucose on the Interferon Yield 



Time of treat- 
ment (hr)" 



None 
0-8 
1.5-8 



Interferon yield at hr* 



8 



24 



768 
64 
96 



96 
16 
12 



a Intracellular i 
o Extracellular S CONTROL 
a Intracellular 1 
* EttracellularJ~ 2 ~ DQ 



Tr 
po 




" All cultures were exposed at 0 hr to poly (I) poly 
(C) {100 Mg/mD in 2 ml of MEM and incubated for 90 
min Thereafter, all cultures were thoroughly washed 
and replenished with 2 ml of MEM containing 2% 
fetal calf serum. In addition, cultures were treated 
with 2-deoxy-D-glucose (6 mM) for the times indi- 

cated. . , .. 

o Culture fluids were collected 8 hr after induction, 
the cultures were washed, replenished with fresh 
maintenance medium (2 ml of MEM with 2% fetal 
calf serum) and incubated until 24 hr. The harvested 
fluids were dialyzed prior to assaying for interferon. 



Fig 2 Effect of 2-deoxy-D-glucose (2-DG) on the 
intracellular appearance of interferon and its secre- , 
tion Cultures were induced with poly(I) poly(C) for 1 
30 min as described in Fig. 1. One group of cultures 
was then replenished with 2 ml of medium containing 
2-DG (10 mM) while the other group receded mainte- 
nance medium. For the determination of intracellular 
interferon levels, media were removed from the cul- 
tures at the indicated intervals, the cell sheet was 
washed twice with ice-cold saline and subjected to 
five cycles of rapid freezing (using a dry 'ce-alcohol 
bl) and thawing at 37= in 2 ml of MEM with 2% 
fetal calf serum. The homogenates were dialyzed pnor 
to interferon assay. For the determination of extracel- 
lular interferon levels, medium from each of the same 
two treated and control plates was collected at 1, 2, 4 
4 5 and 7 hr. After collecting the medium, the celis 
were quickly washed once with warm (37°) medium, 
replenished with 2 ml of fresh warm maintenance 
medium (with or without 2-DG added) and immedi- 
ately returned to the incubator until the next interval. 
All collected fluids were dialyzed prior to interferon 
assay. Since all preceding culture fluid samples had 
been collected at hourly intervals, the 7-hr extraceliu- 1 
lar interferon titers were divided by 2 to correct for the • 
longer interval that had elapsed before the last 
harvesting period. Thus all extracellular interferon , 
titers actually represent the cumulative yield* 
whereas intracellular interferon titers represent the 
net amount of interferon recovered at the particular 
interval from the cell extracts, after dilution of the 



I 



interval num > 

extracts in the 2 ml of medium added before freeze- > 

thawing. 
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Tbus the inhibitory action of this com- 
nound could not be attributed simply to an 
interference with the release process. How- 
pver in addition to the reduction m the 
titer of intracellular interferon, treatment 
w ith 2-deoxy-D-glucose also caused a slight 
delay in the extracellular appearance of the 
interferon that escaped intracellular inhi- 
bition, suggesting some disturbance of the 
release mechanism. Note that at 2 hr the 
ra tio intracellular/extracellular interferon 
is 2 in the control cultures compared to 10 
in the treated group. 

Treatment with D-glucosamine also in- 
hibited the appearance of both intracellu- 
lar and extracellular interferon, whereas 
tf-acetylglucosamine exerted no signifi- 
j ca nt effect on either intracellular or extra- 
cellular interferon titers (Table 2). 

Reversal of the Inhibitory Effect of 
2-Deoxy-n-Glucose by Mannose 

Kaluza et al. (1973) showed that the addi- 
tion of D-mannose specifically interfered 
with the inhibitory action of 2-deoxy-D- 
glucose on the synthesis of infectious Sem- 
liki Forest virus. Earlier, Kaluza et al. 
(1972) demonstrated that a number of 
sugars, including mannose, did not pro- 



TABLE 2 

Effects of d-Glucosamine or N-Acetylglucosamine 
on the Intracellular and Extracellular 
Appearance of Interferon 



Treatment 0 




Interferon yield 




Intra- 


Extracellular at hr 6 




cellular" 








3 


4 6 


None 


64 


1,536 


384 128 


D-glucosamine 


12 


128 


64 64 


N-acetylgluco- 


64 


1,024 


384 96 


samine 









" Cultures were incubated with poly(I)poly(C) for 
30 min as described in Fig. 1. Maintenance medium 
contained D-glucosamine (46 mM), N-acetylglucosa- 
mine (45 mM), or no addition throughout the rest of 
the experiment. 

" Determined in cell extracts prepared at 3 hr after 
induction, as described in Fig. 2. 

c See Fig. 2. All results are actual interferon titers 
of the samples not corrected to yields/hr. 



duce a reversal of the inhibitory effect of 
D-glucosamine on the multiplication of en- 
veloped RNA viruses. 

At concentrations of 5 mM, 2-deoxy-D- 
glucose and D-glucosamine inhibited 
[ 14 CJleucine incorporation by 60 and 46%, 
respectively (Fig. 3). [In other experi- 
ments, a similar degree of inhibition ot 
protein synthesis by these compounds was 
seen in cells that had also been induced 
with poly(I) -poly(C).] At the same concen- 
trations both of these compounds inhibi- 
ted the interferon yield by 90%. The ad- 
dition of mannose afforded a partial 
reversal of the inhibition of protein synthe- 
sis by 2-deoxy-D-glucose, while not affect- 
ing ( 14 C]leucine incorporation in the pres- 
ence of glucosamine. Even at the lowest 
concentration employed, mannose caused 
a complete reversal of the inhibitory effect 
of 2-deoxy-D-glucose on interferon produc- 
tion, while exerting a much less pro- 
nounced reversal of the action of glucosa- 
mine. Increasing the concentration of glu- 
cose in the medium caused only partial 
reversal of the inhibitory effect of 2-deoxy- 
D-glucose on interferon production (data 
not shown here). 

These results are in general agreement 
with those of Kaluza et al. (1973) who 
showed that, although the cellular uptake 
of 2-deoxy-D-glucose was more strongly 
inhibited by excess glucose than by man- 
nose, mannose had a more pronounced 
inhibitory effect on the incorporation of 
labeled 2-deoxy-D-glucose into acid-insolu- 
ble material. They proposed that 2-deoxy- 
D-glucose exerts its inhibitory effect on the 
multiplication of enveloped viruses mainly 
by being incorporated into viral glyco- 
proteins instead of mannose. 

The results of this experiment also show 
the disparity between the effect of these 
compounds on [ 14 C]leucine incorporation 
and interferon production. In the absence 
of mannose, amino acid incorporation was 
inhibited less markedly than interferon 
production. However, in the presence of 
various concentrations of mannose, 
2-deoxy-D-glucose still afforded a 33-23% 
inhibition of cellular protein synthesis 
while interferon production was increased 
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Fig 3 Comparison of the effects of 2-deoxy-D- 
A and ^cosarnine on l^^^ 
Ln and interferon ^^^^ 
n*C]leucine incorporation. Cultures we 
bated for 2 hr with 2-deoxy-o-glucose (5 ™M) Sluco 
lineTs mm or no inhibitor plus the concentration 
samme (5 mM) or n consl sted 
ofmannose indicated. 1 he culture , 14CUeu _ 
of complete MEM with 2% fetal calf serum I C]leu 
m 4 Id/culture) was added to the cultures at 2 
cine (0.4 jiOi/cuiunt;/ y» loKplins was 

u n ft a r tho onset of treatment. The laDenng ^ 
allowed to P r c d L the presence of the inhibitors 

mannose for 1 hr. 
irv expressed as cpm/culture, was determined as 
ScSTta Material and Method, ™™£ 
Stimulation of c~* V^J^^ 

ZZ^^^ « 
Ze (5 mM) or no inhibitor plus the mdicated 
rit^of mannose. Culture fluids har- 
vested at 24 hr, dialyzed and assayed for interferon. 



feron made in the presence of a partially 
inhibitory concentration of one of these 
drugs would show altered thermal stabil j , 
L This consideration rests on the a S - , 
sumption that a minor change » m carbohy. ) * 
Sate content would tend to affect thermal ; 
stability rather than biological activity. I 

To test this idea, a group of cultures wa 
induced with P oly(D-poly(C ^d °^half 
of the cultures was treated with 2-deoxy-D- 
Icos In order to increase the yield of ] % 
interferon, both sets of cultures ™ 
Superinduced'' by sequential treatment 
^ cycloheximide and v**!*^" 
described (Vilcek and Havell 1973). Al- 
though the yield of £ 
2-deoxy-D-glucose-treated cultures was 

only 17% of the yield from control cultures, 
tl activity (7,168 units/ml) was* 
ciently high for the analysis of thermal 
nactivation kinetics at 56°. Since the rate 
of thermal inactivation of interferons is 
knownTo be affected by changes in hydro- 
eenTon concentration (Marshall et al 
1972) both preparations were irst dialyzed 0 ne 
against phosphate-buffered saline, P H 7.1 | ,6 * 

A comparison of the two curves (Fig. 4) 
showsThat inactivation of the interferon 
made in the presence of 2-deoxy-D-glucose 
proceeded at a somewhat faster rate. The 
re' oducibility of this finding was con- 
futed in three separate experiments em- 
S two different batches of interferon 
from control and 2-deoxy-D-glucose-treated 

CUl The eS availability of interferon antisera 
made it possible to compare ^v**^* 
tion of control interferon and mt ertewo 
made in the presence of the inhibitors. The | o 
^utralizing'titer against interferon pro-| . 
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even above control levels, t was ^demon- 
strated earlier than partial inhibition of 
protein synthesis may augment interferon 
? eld presumably through Preferential in- 
hibition of synthesis of a posttranscrip- 

The S^trr^ * 5 

be responsible for the characteristic thai- :Tabte JM inhib i t0 r y effect » w 

production. The neutral 

&^«ff=^ ^-^cantWdifferentfro., - 
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MINUTES AT 56° 
Fig. 4. Comparison of the rate of inactivation at 
56° of interferon produced in the presence of 2-deoxy- 
D -glucose (2-DG) and control interferon. High-titered 
interferon was prepared in FS-4 cells superinduced by 
sequential treatment with cycloheximide and ac- 
\ tinomycin D as described in Materials and Methods. 



irst dialyzed J One-half of the cultures was also incubated with 2-DG 
ine, pH7.4. 
rves (Fig. 4) 
te interferon 
xy-D-glucose 
•,er rate. The 
\g was con- 



(6 mAf) during the entire period of interferon produc- 
tion. Both the control interferon and 2-DG interferon 
were dialyzed against two changes of phosphate-buff- 
ered saline, pH 7.4. Aliquots of both preparations 
were heated in a 56° -water bath. Samples were taken 
at the indicated intervals, diluted in ice cold medium 
-p and assayed for interferon. The initial interferon 

rimentS, em- | of the un heated preparations were 40,960 

if interferons (contro i) and 7,168 (2-DG). 
icose-treated 

its titer against control interferon (data not 
?ron antisera shown here). 

ie neutraliza- The decreased neutraliz ability of inter - 
id interferon feron from glucosamine-treated cultures 
hibitors. The could be the result of one of the following 
terferon pro- two events: either the biologically active 
•tially inhibi- interferon in these preparations is altered 
:osamine was structurally and therefore has a lower affin- 
than against ity for antibody, or, alternatively, these 
s produced in interferon preparations contain an excess 
;ultures or inl of interferon molecules that, while biologi- 
rlglucosamine.cally inactive, are antigenic and thus capa- 
that iV-acetyl- hie 0 f competing for antibody binding sites 
itory effect on 
neutralizes 
against inter- 
of 2-deoxy-D- 
different from 



glucose and D-glucosamine could be the 
result of (1) interference with the energy 
supply of the cell, (2) some secondary effect 
due to depression in overall protein synthe- 
sis, or (3) inhibition of glycosylate or 
improper glycosylation of the interferon 
polypeptide. 

Both D-glucosamine and 2-deoxy-p- 
glucose were shown to cause a decrease in 
the intracellular pool of several nucleotides 
(Letnansky, 1964; Bekesi and Winzler, 
1969; Keppler et al., 1970; Scholtissek, 
1971). Although this action would be ex- 
pected to result in a decrease of cellular 
energy supply, available results do not 
support the view that the effects of the 
antimetabolites can be explained simply 
by lack of energy. In chick cells treated 
with 2-deoxy-D-glucose, Kaluza et al. 
(1973) found no decrease in the energy 
charge of the adenylate pool. Our finding 
that [ 14 C]leucine incorporation into total 
cellular protein was decreased by only 
about 50% is also not consistent with such 
an interpretation. Even prolonged treat- 
ment of cultures with the antimetabolites 

TABLE 3 

Neutralization of Interferon Made in the 
Presence of d-Glucosamine by Anti-Interferon 
Antibody 



with biologically active interferon. 

DISCUSSION 

The inhibition of the production of bio- 
logically active interferon by 2-deoxy-D- 



Interferon preparation 0 


Neutralizing titer of 


antiserum* in Expt. no. 




1 


2 


3 


Control 


560 


535 


819 


D-glucosamine 


■66 


120 


205 


JV-acetylglucosamine 


860 


ND C 


ND 



"Produced as described in Fig. 1, except that 
culture fluids were collected at 3 hr after induction 
with poly(I) poly(C). The preparations used in Expt. 
no. 1 and 2 were produced in media that contained 
D-glucosamine (4.6 mM), iV-acetylglucosamine (4.5 
mM) or no addition and the respective interferon 
titers of the undiluted preparations were 128, 1,536 
and 768 units/ml. The preparations used in Expt. no. 
3 were the 3 hr extracellular fluids listed in Table 2. 
All samples were dialyzed against two changes of 
phosphate-buffered saline and one change of MEM 
before use in the neutralization assay. 

6 Against 10 units of interferon/ml as described in 
Materials and Methods. 

0 Not done. 
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was not toxic and the cells continued to 
3hX albeit at a reduced rate (unpub- 

^hough both compounds caused a de^ 
teclaWe fnhibition oi ; labeled ^ 
incorporation, this reduction in total cellu 
la ^ protein synthesis could hardly account 
for ?he inhibitory effect on interfere. i pro- 
duction. The strongest arguments , aga* t 
such interpretation come from the result 
of the experiment shown m Fig. 3, demon 
strating that the degree of inh bit on of 
overall protein synthesis is not directly 
related to the magnitude of the interferon 

re The Se specific nature of the inhibitory 
effect is also strongly supported by the 
findLg that the addition of as httle as 0 5 
mM mannose completely reversed the in- 
hibitory action on interferon Production. of 
a tenfold excess of MW^JJ ^s 
supports the view expressed earlier that 
2-deoxy-D-glucose acts as an antimetabo- 
lite Tmannose and that it is perhaps 
incorporated into glycoproteins instead of 
mannose (Kaluza et al, 1973). 

Sly two direct lines of evidence sug- 
gest that treatment with both compounds 
Say result in the synthesis of structurally 
Xred interferon molecules: the decreased 
thermal stability of interferon made m the 
presence of 2-deoxy-D-glucose and the ap- 
parent lower affinity for antibody of inter- 
feron made in cells treated with ^-glucosa- 
mine. Although direct proof is lacking it is 
Certainly conceivable that these altered 
ThiactLtics are the ™^f^£ e 
complete or improper glycosylate ot the 
interferon molecule. 

As with most other glycoproteins (Spiro, 
1970 1973), the functional role ot the 
carbohydrate moiety of interferons rem ams 
obscure. Two hypotheses have attempted 
to account for the general significance ot 
the carbohydrate portions of glycoproteins. 
Eylar (1966) proposed that proper glycosy- 
late may be the chemical passport _ for 
export from the synthesizing cell. The idea 
that the carbohydrate moiety is a prerequi- 
site for secretion was challenged by Wm- 
terburn and Phelps (1972) who pointed out 
that many secreted proteins (e.g . serum 
albumin) are not glycosylated. Instead, the 
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latter authors proposed that the sugar 
portion may be important in determining 
?he extracellular fate and function of the 
molecule, such as the recognition of target 
cells It should be pointed out that the two 
hypotheses are not mutually exclusive. 

The inhibitory effect of 2-deoxy-D- 
glucose and glucosamine on interferon pro- 
duction can not be fully explained by 
selective inhibition of the secretion process 
alone, since intracellular levels of inter- 
feron were also suppressed. However he 
results shown in Fig. 2 do not rule out the 
possibility that a lack of proper glycosyla- 
te may partially impede the release of 
interferon from producing cells. 

It remains to be explained why proper 
glycosylation may be required for the syn- 
desis of biologically active interferon. Al- 
though it seems attractive to speculate 
that the carbohydrate portion is important 
in the interaction of interferon with a 
cellular receptor site on target cells, this 
interpretation appears ^likely in view o 
two findings: it was shown that rabbit 
interferon retains full bio ogical activity 
after removal of sialic acid (Schonne et d 
1970; Dorner et al, 1973), and enzymatic 
removal of about 50% of total carbohydrate 
from a preparation of human interferon 
also failed to reduce biological activity (L. | 
B. Anfinsen, personal communication) Al- 
though these treatments resulted only m | 
S removal of carbohydrate it seems 
likely that if indeed the carbohydrate por- 
tion determines the fraction with 
cellular receptor site, the specificity of this 
reaction would depend on the expo d 
terminal carbohydrate residues. It £po«- 
ble however, that some parts of the core 
portion of the carbohydrate chains may be 
important for conformational stability oi 
the interferon molecule which in turn, 
might be required for full biological activ- 

lty Alternatively, a defect in glycosylation' 
might interfere with the proper processm 
of the interferon molecule by the producin 
cell in a way similar to that observed * 
cells infected with fowl plague virus. In the 
latter system, D-glucosamine and 2-deoxy- 
D-glucose were shown to inhibit viral M l 
magglutinin and neuraminidase synthesis 
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tf hile leading to the accumulation of a 
c arbohydrate-free polypeptide that ap- 
peared to be an inactive precursor of the 
viral glycoproteins (Klenk et al, 1972). 
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Human leukocyte and tritium-labeled fibroblast interferons, prepared by induction with Sendai virus 
and with double-stranded polyinosinic acid • poiycytidylic acid respectively, have been studied in relation 
to the carbohydrate moieties attached to them. These interferons were partially purified by immunoab- 
sorbance and by gel filtration. On treatment with glycosidases. about 80''i of the 3 H-tabeled sugar 
moieties in this glycoprotein -containing fraction was removed without detectable alteration of the 
antiviral activity or antibody-binding properties characteristic of interferon. The molecular weight of 
leukocyte interferon was reduced by about 4000. As others have reported, the heterogeneous character of 
interferon revealed by isoelectric focusing was greatly reduced by the enzyme treatment. 
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The study of the purification and properties of interferon has 
attracted much attention because of the potentially valuable 
antiviral activity of this substance. It has not yet been possible 
to produce interferon in large quantities by tissue culture. 
Consequently, many investigators have focused either on its 
isolation in pure form, for the purpose of determination of 
structure and. possibly, for future chemical synthesis, or on the 
development of potent, nontoxic inducers to stimulate endoge- 
nous interferon production. 

Schonne et al. (1) observed a decreased charge heterogeneity 
of rabbit interferon upon isoelectric focusing following treat- 
ment with neuraminidase. They proposed that interferon is a 
glycoprotein, possessing variable amounts of sialic acid. Later, 
this hypothesis was confirmed by Dorner et al. (2). Rabbit 
interferon, upon treatment with neuraminidase, exhibited a 
single peak focusing at pH 6.3. This asialo-interferon regained 
its original heterogeneous character after reincorporation of 
sialic acid residues by the use of sialykransferase. Also, the 
terminal sequence of the carbohydrate moiety was identified as 
sialic acid -• galactose. 

The studies of Schonne et at. (1) and of Dorner et al. i2), 
referred to above, have shown that terminal sialic residues may 
be removed without loss of antiviral .activity. In the case of the 
latter studies (2), the next carbohydrate residue in the polysac- 
charide chain, galactose, could be successively oxidized with 
galactose oxidase and reduced with sodium borohydride with- 
out loss of activity. Since most glycoproteins studied to date 
contain one or more polysaccharide chains with molecular 
weights of 1000 to :?00(), the present studies were carried out to 
examine ihe effects of more extensive carbohydrate removal. 

Such studies are of importance in connection with the 
feasibility of eventual synthesis. This paper summarizes exper- 
iments on human leukocyte and fibroblast interferon that 

•Recipient of a fellowship from "Stiftung fur Stipendien aul dem 
Gebiete der Ohemie," Basel, Switzerland. 



indicate the dispensability of the bulk of the carbohydrate 
component. 

MATERIALS AND METHODS 

Human leukocyte interferon prepared from human leukocytes 
induced with Sendai virus by H. Strander and K. Oamell (.'() and 
Smith. Kline and French Laboratories was supplied tei us by the 
Antiviral Substances Program or the National Institute tor Allergy and 
Infectious Diseases. 

Fibroblast interferon was prepared with the kind cooperation »i J. 
Vilcek and E. Havell at New York University, using the FS-4 strain of 
human foreskin diploid fibroblasts induced with a double-stranded 
complex of polyinosinic acid -poiycytidylic acid (4). Tritiated glyco- 
sides were incorporated into interferon preparations by adding D- 
( 3 H (glucosamine-hydrochloride with a specific activity of 10.13 Ci/mM 
(New England Nuclear. 5 mCi/100 mi of culture fluid) to Eagle's 
minimal essential medium with a glucose concentration lowered to 20 
mg/100 mi. The tissue culture fluids were first desalted by passing 
them through a column of Sephadex (i-25 medium (5.'2 x 7r> cm) in 0.1 
M acetic acid tn remove ihe bulk of the radioactivity. 

The s H-labeled protein fraction from fibroblast cultures was purified 
by immunoabsorbance chromatography (5. 6) and further by gel 
filtration on Sephadex (MOO (fine 2.6 x 90 cm). Human leukocyte 
interferon was subjected to the same purification procedures, interfer- 
ons were eluted from the affinity column by a pHgradient (6} 
containing cytochrome c 10.5 mg/ml) and from G-100 columns by 0. 1 M 
acetic acid containing NaCl (0.15 mi and cytochrome c (0.5 mg/mJi. 
The partially purified interferons were diaiyzed against 0.1 m acetic 
acid and lyophilized. 

Interferon assays were carried out by a modification of the micro 
method originally described for rabhit kidney cells (7). The reference 
human interferon preparation. 69/19, titers 5 a !0 ! units/'ml in this 
assay. The specific activities nf partially purified interferons have not 
been estimated in the current studies. As discussed earlier (fiJ, a large 
purification (20(1 1<> DOO-fold) is obtained in the first step (immunoab- 
sorbance on antibody-Sepharose columns). Indeed, the amounts ot 
protein present in the interferon fraction is sufficiently low that it is 
necessary to add a protective protein (in our experiments, cytochrome 
ci to prevent losses due to adsorption in subsequent steps. The current 
studies are concerned with the physical behavior before and alter 
removal of carbohydrates. The changes in charge and mass have been 
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related to the migration of material possessing antiviral activity during 
isoelectric focusing and gel filtration. 

The mixture of glycosidases used to partially remove the carbohy- 
drate moiety from interferon was prepared (8) from a strain of 
Diplococcus pneumoniae. Type 1, with the kind cooperation of G, 
Ashwell. National Institute of Arthritis, Metabolism, and Digestive 
Diseases. a-Mannosidase (49 units/ml) was a gift from Dr. Ashwell. 
The glycosidase preparation (8, 9) contained £-galactosidase (4.6 
units/ml), JV-acetylglucosaminidase (10.4 units/ml), neuraminidase 
(2.1 units/ml), and an undetermined amount of endoglycosidase (10). 
The protein concentration of this preparation was 3.9 mg/ml. 

Treatment of 'H-tabeted Interferon with Glycosidases— The par- 
tially purified and lyophilized interferon was redissolved in Mcllvaine 
phosphate/citrate buffer, pH 6.0, and the initial radioactivity was 
determined. Solutions of glycosidases (2 jil/1000 units of interferon) 
and a- man nosidase (0.25 pI/lQOO units of interferon) were added and 
the mixture was incubated at 37". AUquots (0,2 ml) were taken during 
the course of the enzymic reaction for interferon assay. For the 
determination of sugar released, aliquots (0.1 ml) were removed and 
precipitated by addition of 0.2 ml of 2% phosphotungstic acid in 0.5 pi 
HC1. After centrifugation, a measured amount of the supernatant was 
counted in a liquid scintillation counter (Table 1). (Leukocyte inter- 
feron was also treated with glycosidases and a-mannosidase under the 
same conditions as described above; see Figs. 1 and 2.) 

For experiments designed to identify the 'H-labeled sugars removed 
by enzyme treatment, the interferon was dissolved in deionized water 
rather than buffer. After 3 hours incubation with glycosidases alone (2 
>il/1000 units of interferon), the reaction mixture was passed through a 
column (1.6 x 28 cm) of Sephadex G-25 medium equilibrated with 0.1 
m acetic acid. The fractions containing the low molecular weight 
material were lyophilized and redissolved in 0.5 ml of water. A 0.1 -ml 
aliquot was applied together with a carrier mixture of mannose, 
galactose, sialic acid, and JV-acetylglucosamine (1 fig each) on a 
Whatman No. 1MM paper. The same mixture was also applied for 
reference and the paper was subjected to descending chromatography 
using as solvent system, pyridine/acetic acid/ethyl acetate/water 
(5/1/5/3, v/v). The reference strip was developed with silver nitrate- 
staining solution (11) and the corresponding areas of the strip with the 
radioactive sample were cut out and counted in a liquid scintillation 
counter. 

Antibody- binding Experiments— Antibodies to human leukocyte 
interferon, prepared in sheep, were purified and bound toSepharose as 
described previously (6). Leukocyte and fibroblast interferons, after 
treatment with enzymes, were passed through a column (1x5 cm) 
prepared from the Sepharose-bound antibodies. The column was 
washed with phosphate-buffered saline and eluted at pH 2.2 with 
Sorensen's citrate buffer containing cytochrome c (0.5 mg/ml). 

Isoelectric Focusing— Leukocyte interferon (6.7 x 10* units), before 
and after treatment with glycosidases and a-mannosidase for 3 hours 
as mentioned above for fibroblast material, was subjected to isoelectric 
focusing at 4° using pH 3.5 to 8.0 Ampholines (LKB, Bromma, 
Sweden) and a 25-ml sucrose gradient of 0 to 50% (Fig. 1) . After 5 hours 
at 310 volts, 0.5-ml fractions were collected and their pH was 
measured. Fractions were assayed for interferon activity. 

Molecular Weight Determination — The apparent molecular weight 
of human leukocyte interferon was estimated by gel filtration using a 
Sephadex G-100 (fine) column (1.5 x 100 cm) equilibrated with 0.1 u 
acetic acid containing 0.15 m NaCI. Ovalbumin, chymotrypsinogen, 
and bovine pancreatic ribonuclease were added as markers and 
dextran blue was used to determine the void volume (elution followed 
at 280 nm on LKB Uvicord). The flow rate was maintained at 10 
ml/hour and 1-ml fractions were collected. The eluates were examined 
at 280 nm for protein concentration, and assayed for interferon and 
ribonuclease (12) activities. The determination was carried out on 
leukocyte interferon (10* units) both before and after treatment with 
glycosidases and a-mannosidase, 

RESULTS AND DISCUSSION 

As can be seen in Table I, incubation of the protein fraction 
containing fibroblast interferon labeled with tritiated sugar 
moieties with a mixture of glycosidases and a-mannosidase 
results in the progressive removal of tritiated sugar residues. 
Between 74 to 85% of the tritium label was removed without 
loss of interferon activity in three experiments carried out on 
different preparations of tritiated interferon. Complete re- 




10 15 20 25 30 
FRACTIONS 

Fig. 1. Isoelectric focusing of human interferon before (a) and after 
(6) removal of the carbohydrate moiety by treatment with glycosidases 
for 3 hours as described under "Materials and Methods." Human 
interferon (leukocyte, 6.7 x 10* units in each case) was focused at pH 

3.5 to 8 in a 0 to 50% sucrose gradient. •, pH profile; O O, 

interferon activity. 

moval of the tritiated sugars was not achieved, even with 
longer enzyme incubation times. This could be due to the fact 
that part of the JV-acetylglucosamine residues are directly 
connected to the polypeptide chain of the glycoprotein and 
that the sugar-amino acid bond cannot be enzymatically 
cleaved. However, the fact that the molecular weight (Fig, 2) of 
leukocyte interferon is considerably decreased by glycosidase 
treatment indicates that a large part of the sugar attached to 
interferon has been removed. 

The tritiated sugars cleaved from the interferon-containing 
glycoprotein fraction by treatment with glycosidases alone 
were separated by paper chromatography. Of the total radioac- 
tivity applied to the paper (572 dpm), 124 dpm were recovered 
in the spot corresponding to sialic acid (R F = 0.50) and 348 
dpm as JV-acetylglucosamine (R F = 0.69). No significant 
radioactivity was detected in any other part of the paper strip. 
Thus, some conversion of the glucosamine, added to the 
culture medium during interferon production, to sialic acid by 
condensation with phosphoenolpyruvate and to JV-acetyl- 
glucosamine by reaction with acetyl-CoA had taken place. 
Conversion of glucosamine to other sugars commonly occurring 
in glycoproteins would not be expected under our experimental 
conditions. 

Treatment of fibroblast interferon, prepared in the presence 
of tritiated glucosamine, with neuraminidase alone resulted in 
the removal of 25 to 30** of the tritium label and in this case 
the only product detected was (*H Jsialic acid. This agrees with 
the findings of Dorner et al. (2) for rabbit interferon, namely 
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Table I 

Emymk cleavage of tritiated sugar moieties from human fibroblast 
interferon 



Interferon 
preparation" 



Table Fl 

tmmunoabsorbance chromatography of human leukocyte and 
fibroblast interferons before and after treatment with glycosidases and 
a-mannosidase 



111 



Intubation 
time 


Amount nf 
'H label in 
supernatant 


n lanei 
released 


InTvpfOFMn 
lil Hrrnrimi 

activity 






% 


units/ml 




X ?0 ' 


u 




1.2 


25 


10 


5.68 


16.3 




20 


10.96 


31.5 


16 


40 


17.98 


51.7 




60 


20.26 


58.3 


25 


80 


23.42 


67.4 




120 


25.88 


74.5 


25 


160 


27.38 


78.8 




240 


28.07 


80.7 


25 


0 


0.3 


0.1 


3.2 


180 


392 


73.4 


3.2 


0 


0.2 


0.2 


5.0 


180 


94.6 


84.7 


5.0 



* Interferon Preparations I, II, and III contained 3.48 x 10\ 5.34 x 
10 ! , and 1.12 x 10 s dpm/ml, respectively. 

"In this experiment, a 3-fold excess of unlabeled fibroblast inter- 
feron was added to the labeled material. 




« 60 SO 100 120 

ELUTION VOLUME (ml) 

Fig. 2. Apparent molecular weight determination by Sephadex 
(G-100, fine) gel filtration of human interferon before and after 
cleavage of the sugar moiety. The column (1,5 x 100 cm), equilibrated 
and eluted with 0.1 m acetic acid/0.15 M Nad, was used for the 
separation of: first, dextran blue, ovalbumin, chymotrypfsinogen, and 
ribonuclease; second, ovalbumin, ribonuclease, and interferon (leuko- 
cyte; 10* units); third, ovalbumin, ribonuclease, and interferon treated 
with glycosidases and a-mannosidase as described under "Materials 
and Methods." The A a* etution profiles of the marker proteins of the 
three column runs were completely indistinguishable when they were 

superimposed; 1-ml fractions were collected. ultraviolet profile at 

280 nm of markers; , interferon activity of (O) untreated and (•) 

enzyme- treated interferon. The experimentally determined values for 
interferon activity were fitted by Gaussian curves, assuming each point 
to have the same statistical value. 

that sialic acid is present as the terminal residue in the 
carbohydrate moiety of this material. 

Neither leukocyte nor fibroblast interferons showed any 
change in binding to the a nti- interferon antibody column as a 
result of treatment with glycosidases and a-mannosidase 
(Table II) under the conditions described under "Materials 
and Methods." In one experiment (No, IB, Table II), the 
enzyme-treated interferon solution (4.4 x 10' dpm) was 
applied to this affinity column. Most of the radioactivity (3.6 x 
10* dpm; &l"<) passed through the column unretarded, while 
0.84 x 10* dpm (19%) were eluted with citrate buffer <pH 2.2) 
along with the interferon activity. These figures are in agree- 
ment with those from the experiments described in Table 1. 



Experi- 
ment No. 


Type of interferon 


Enzyme 
treat- 
ment 


Total 
units 
applied, 

* 10-' 


Units bound 
to anti- 
interferon 
column, 
x 10 * 


Re- 
covery" 












% 


1A 


Fibroblast 




19 


8.5 


45 


IB 


Fibroblast, 're- 


+ 


9.9 


4.3 


43 




labeled sugar 










2A 


Leukocyte 




90 


53 


59 


2B 


Leukocyte 


+ 


90 


64 


71 



"Recovery refers to the antibody-bound interferon, eluted with 
citrate buffer, pH 2.2. 
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2.0 
1.8 
14 
1.7 
1.8 
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4.6 4.1 A2 4.3 4.4 4.5 4.6 4.7 4* 
LOGARITHM OF MOLECULAR WEIGHT 

FiC. 3. Graphic determination of the molecular weight of interfer- 
ons before and after enzymic removal of the carbohydrate portions 
(Fig. 2). The ratios of the elution volume of each protein (V„) divided 
by the elution volume of blue dextran (V 0 ) were plotted against the 
logarithm of molecular weights of the protein markers: ovalbumin (4.3 
x 10*), chymotrypsinogen (2.32 x 10*), and ribonuclease (1.37 x 10'). 



This indicates that the removal of much of the carbohydrate 
from interferon does not affect its ability to bind to its specific 
antibodies. 

Isoelectric focusing experiments on both types of interferons 
indicate that after partial removal of the carbohydrate moiety 
from interferon, its heterogeneous character is reduced (Fig. 1). 
The recovery of leukocyte interferon (6.7 x 10' units) on 
isoelectric focusing was 36%, whereas the recovery after en- 
zymic removal of the carbohydrate moiety was 71%. This type 
of heterogeneous behavior on isoelectric focusing was previ- 
ously observed by Schonne et al. (1) using rabbit interferon and 
by Stancek et al. (13) using rabbit, mouse, and human 
interferon. The heterogeneity was reduced after treatment with 
neuraminidase (1, 2). Dorner et al, (2) showed that the 
terminal sugar residue of the carbohydrate moiety was sialic 
acid. 

The results of molecular weight determinations on leukocyte 
interferon before and after treatment with glycosidases are 
shown in Figs. 2 and 3. After enzyme treatment (glycosidases 
arid a-mannosidase), the apparent molecular weight of the 
interferon is decreased by about 4,200. This observation was 
varified in three separate experiments. The molecular weight 
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determined for untreated interferon is 26,000 * 1,200. It should 
be noted, however, that glycoproteins are not believed to 
behave as globular proteins on gel filtration (14). In addition, 
since losses of interferon activity occurred during gel filtration 
in neutral buffers, our columns were eluted under acid condi- 
tions which may cause some protein unfolding. Therefore, we 
stress that the change in molecular weight of interferon is the 
important observation rather than the absolute molecular 
weight values. 
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SUMMARY 

The inhibitors of glycosylation, 2-deoxy>D-glucose or d- 
glucosamine, inhibit the synthesis of biologically active in- 
terferon in human FS-4 fibroblast cultures stimulated with 
polyinosinate-polycytidylate. Interferon synthesized in the 
presence of partially inhibitory concentrations of 2-deoxy-D- 
glucose or D-glucosamine were found to differ from interfer- 
ons made in control cultures in some physical properties. 
Interferons synthesized in the presence of either inhibitor 
had a diminished charge heterogeneity demonstrable by iso- 
electric focusing. On sodium dodecyl sulfate-polyacryl- 
amide gel electrophoresis, control interferon activity 
formed a single peak with the apparent molecular weight of 
20,000, whereas interferons from cultures treated with 
either inhibitor could be resolved into two distinct molecu- 
lar weight components, one of which was smaller than the 
interferon synthesized in control cultures. 

Most interferons are probably glycoproteins. Two groups of 
investigators (1, 2) observed that neuraminidase treatment of 
rabbit interferon resulted in a considerable decrease in the 
charge heterogeneity observed on isoelectric focusing. Dorner 
et al. (2) were able to restore the charge heterogeneity of 
desialylated rabbit interferon by its incubation with JV-acetyl- 
neuraminic acid and sialyltransferase. 

Several lines of evidence suggest that human interferon 
derived from cultures of skin fibroblast cells is also glycosyl- 
ated. Jankowski et al. (3) have shown that human fibroblast 
interferon was capable of binding to several immobilized lec- 
tins specific for AT-acetylneuraminic acid, L-fucose, and D-man- 
nose. Studies on the intracellular site of human fibroblast 
interferon synthesis have shown that it is probably synthe- 
sized in the membranes of the rough endoplasmic reticulum 
(4) and that it may follow the secretory pathway of other 
known glycoproteins (5, 6). The synthesis of interferon is 
inhibited in a dose-dependent manner by the inhibitors of 
glycosylation, 2-deoxy-D-glucose, and o-glucosamine (7). The 
2-deoxy-D-glucose is known to act as an anti-metabolite for 
mannose (8), whereas the precise mechanism of action of r> 
glucosamine remains unknown (9). 

We have extended our studies on the physical nature of the 
interferons synthesized in the presence of partially inhibitory 
concentrations of either 2-deoxy-D-glucose or D-glucosamine. 

* This work was supported by United States Public Health Service 
Grants AI-07057, A!- 12948, and Contract AI-02169 from the Na- 
tional Institute of Allergy and Infectious Diseases. 

t Permanent address, National Institute of Health of Japan, Mu- 
rayama Annex, Nakato, MusaehioiurayBnia, Tokyo, 190-12, Japan. 



In this communication, we report on the effects of these inhibi- 
tors of glycosylation on the molecular weight and isoelectric 
point of the interferons synthesized in cultured human fibro- 
blast cells. 

MATERIALS AND METHODS 

Cultures of the human FS-4 fibroblast strain were grown in l h- 
gallon roller bottles and induced with the synthetic polyribonucleo- 
tide poly (I) ■ poly (C) , 1 according to the method described earlier (10). 
Cultures were incubated for 6 h with polyd)-poly(C) (5 uglml, P-L 
Biochemicals, Milwaukee, Wis.) together with cycloheximide (50 fig/ 
ml, Upjohn Co., Kalamazoo, Mich.) in 50 ml of Eagle's Minimal 
Essential Medium (MEM, GIBCO). Actinomycin D (1 fig/ml, Calbi- 
ochem, San Diego, Calif.) was added to the induction medium at 5 h. 
At the end of the 6-h induction period the medium was removed, and 
the cultures extensively washed and then reincubated at 34° for an 
additional 24 h in the interferon production medium which consisted 
of SO ml of MEM buffered with 4-(2-hydrojtyethyl)-l-piperazineeth- 
anesulfonic acid (Hepes, 13 mM), JV-[TrisChydroxymethyD- 
methyllglycine (Tricine, 6 mil), and sodium bicarbonate (1.1 g/iiter), 
supplemented with 0.5% heated fetal calf serum (56° for 30 min). 
Glucosamine or 2-deoxy-D-glucose was added to the production media 
at a final concentration of 10 mM and 5 mM, respectively. These 
concentrations of glucosamine or 2-deoxy-D-glucose inhibited the 
production of biologically active interferon by about 90%, as com- 
pared to inhibitor-free control cultures. 

The interferon activities of the preparations were determined by 
the microtiter assay method as previously reported (8). The inter- 
feron preparations were concentrated approximately 10-fold by di- 
alysis against the hygroscopic agent, Ficoll 400, prior to use in these 
experiments. Following concentration, the interferon titers for the 
control, 2-deoxy-u-glucose, and glucosamine interferons were 
200,000, 25,000, and 6,000 units/ml, respectively. The lower titers of 
the preparations from inhibitor-treated cultures are due to the inhib- 
itory action of 2-deoxy-D-glucose and glucosamine on interferon syn- 
thesis (7). 

RESULTS AND DISCUSSION 

The concentrated interferon preparations obtained from in- 
duced cultures of FS-4 fibroblast cells treated with or without 5 
mM 2-deoxy-D-glucose were analyzed by sodium dodecyl sul- 
fate-polyacrylamide gel electrophoresis (11, 12) to determine 
the molecular weights of each interferon species present in 
either preparation (Fig. 1). The interferon activity found in 
the interferon preparations obtained from the untreated FS-4 
cultures (Control A) migrated as a homogeneous species with 
a molecular weight of 20,000. This molecular weight agrees 
with the value of Knight (13) for purified FS-4 fibroblast 
interferon and with our own earlier reported results (14), 
Sodium dodecyl sulfate-polyacrylamide gel electrophoresis of 
the interferon preparations from induced FS-4 cultures treated 
with 2-deoxy-D-glucose or glucosamine revealed the presence 
of two distinct interferon peaks, with about equal distribution 
of interferon activity in the two peaks. The slower migrating 
interferon species possessed a molecular weight similar to that 
found for the control interferon, whereas the faster migrating 
interferon activity was determined to have a molecular weight 
of 16,000. It should be noted that the lower activities for the 
interferons produced in the presence of either inhibitor of 
glycosylation are due to smaller amounts of interferon placed 
on the sodium dodecyl sulfate gels and not to the less efficient 
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Fig. 1. Sodium dodecyl sulfete-polyacrylamide gel electrophore- 
sis of interferons made in the presence of either 2-deoxy-D-glucose or 
D-glucoBamine. Sodium dodecyl sulfate-polyacrylamide gel electro- 
phoresis was done according to the method of Stewart and Desmyter 
(12). This electrophoresis procedure uses 10% polyacrylamide and 0. 1 
m sodium phosphate buffer, pH 7.4, with 0.1% sodium dodecyl sul- 
fate. Prior to electrophoresis, the samples were incubated at 37° for 1 
h in 1% sodium dodecyl sulfate, then dialyzed in 0.01 m sodium 
phosphate buffer, pH 7.4, with 0.1% sodium dodecyl sulfate. The 
dialyzed samples were then made to 10% in sucrose and electropho- 
reses for 18 h. The current applied was 5 mA/disc get. Lysozyme, 
chymotrypsin, ovalbumin, and bovine serum albumin were used as 
markers for molecular weight determinations. Following electropho- 
resis, the gels were frozen and sliced into 1-mm slices, 2 adjacent 
slices (referred to as one fraction) were pooled, and interferon activ- 
ity was eluted in 1 ml of MEM containing 5% fetal calf serum at room 
temperature overnight. Approximately 50% of the interferon activity 
applied to the sodium dodecyl sulfate gels was recovered in all four 
groups. 

recovery of these interferon species from the sodium dodecyl 
sulfate gels. 

The resulting isoelectric focusing distribution of interferon 
activities for the interferons from untreated and inhibitor- 
treated cultures is shown in Fig. 2. The interferon activity 
present in the control preparation was recovered from two 
regions of the pH gradient. Considerable amounts of inter- 
feron activity were recovered from the top of the gels (pH 4.0). 
This portion of the interferon activity may either not have 
entered the gel, or it may represent interferon noncovalently 
bound to albumin (16) or other proteins with an isoelectric 
point near pH 4.0. The rest of interferon activity was recovered 
from a very broad region centering just past neutrality, with 
the bulk of interferon activity focusing in the pH region 7.2 to 
7.6. This broad heterodisperse activity focusing in the range of 
neutrality was not apparent in either of the two interferon 
preparations prepared in the presence of either 2-deoxy-D- 
glucose or ^glucosamine. Instead, both of these interferon 
preparations focused rather sharply, with most of the inter- 
feron activity found in a single fraction corresponding to a pH 
of 7.4. Considerable differences were noticed in the isoelectric 
focusing patterns of interferon preparations from the two in- 
hibitor-treated FS-4 cultures in the acidic regions of the pH 
gradient. It is of interest that, compared to control interferon, 
less of the 2-deoxy-D-glucose interferon, and even less of the 
interferon made in the presence of D-glucosamine, was re- 
covered from the top of the gradients. Although this result is 
not completely understood, it may indicate that aggregation of 
interferon with albumin or other material is the function of 
the carbohydrate moiety. 

These results supplement our previous observations on the 
inhibitory action of 2-deoxy-D-glucose and D-glucosamine on 
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Fig. 2. Isoelectric focusing of interferons synthesized in the pres- 
ence of either 2-deoxy-D-glucose or D-glucosamine. A modification of 
the isoelectric focusing procedure of Gainer (15) was used to deter- 
mine the isoelectric points (pis) of the interferons. Cylindrical disc 
gels (0.6 x 8 cm) consisted of 1.8 ml of 7.5% polyacrylamide contain- 
ing 2% ampholytes (LKB, Hicksville, New York, pH range 3 .5 to 10). 
Samples to be isoelectrically focused were dialyzed against a 1% 
glycine-distilled H 2 0 solution, made to 10% in sucrose, and 0.2 ml of 
this material was layered on the anode end of the gel. The solutions 
in the electrode vessels were 1% HzSO, (anode) and 2% ethanolamine 
(cathode). Isoelectric focusing was done at 4° over approximately a 5- 
h period, with an initial potential of 150 V. Fractionation and elution 
of interferon was essentially as described for sodium dodecyl sulfate- 
polyacrylamide gels, except elution was done at 4°. Approximately 
25% of the interferon activity applied to the gels containing amphol- 
ytes was recovered in all groups. Determination of the pH gradient 
was done by eluting fractions of a blank gel overnight at 4° in 
distilled H,0 and measuring the pH at 4°. 

the synthesis of human fibroblast interferon (7). They support 
more directly the conclusion that the interferon molecules 
synthesized in the presence of inhibitors are altered. Our 
sodium dodecyl sulfate-polyacrylamide gel electrophoresis re- 
sults suggest that interferons made in the presence of partially 
inhibitory concentrations of the two drugs may be a mixture of 
altered and unaltered (or minimally altered) interferon mole- 
cules. If the differences in the molecular weights of the two 
interferon peaks are determined by the presence or absence of 
carbohydrates, then the carbohydrate would be likely to ac- 
count for at least 20% of the total molecular weight of human 
fibroblast interferon (16,000 versus 20,000 molecular weight). 
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This calculation is in good agreement with an independent 
determination obtained by incubating human interferon with 
various purified glycosidases (17). This treatment resulted in a 
decrease in molecular weight and reduction of the charge 
heterogeneity of the interferon preparation. 
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Contributed by Mark Ptashne, June 26, 1980 

ABSTRACT We applied the method of Guarente et al. 
[Guarente, L,, Lauer, G., Roberts, T. M. & Ptashne, M. (1980) Cell 
20, 543-553] to construct plasmids that direct expression in 
Escherichia coli of the human fibroblast interferon (F-IF) gene. 
Two plasmids were recovered. One directs efficient synthesis 
of a protein whose primary sequence is that of pre-F-IF and the 
other, that of mature F-IF. Extracts of bacteria synthesizing 
mature F-IF display antiviral activity characteristic of human 
F-IF. This activity is lower than that expected from the differ- 
ential rate of synthesis of the protein. We have detected no such 
activity in extracts of bacteria synthesizing pre-F-IF. 

Human fibroblast interferon (F-IF) is a glycoprotein produced 
by human fibroblasts in response to virus and certain polynu- 
cleotides (1, 2). The secreted protein has potent antiviral activity 
that is readily assayed in vitro (2, 3). The sequence of the 
amino-terminal 13 amino acids of F-IF has been reported 
(4). 

A cDNA molecule encoding human F-IF was cloned by 
Taniguchi et al. (5). The DNA sequence of this molecule pre- 
dicts that the secreted F-IF contains 166 amino acids, the first 
13 of which would be identical to the corresponding sequence 
of the protein as determined by Knight et al. (4). Moreover, the 
sequence is consistent with the idea that F-IF is synthesized as 
a precursor (pre-F-IF) with a 21-amino-acid hydrophobic 
leader at its amino terminus (6, 7). 

A series of papers from this laboratory* has developed 
methods to express cloned prokaryotic and eukaryotic genes 
in Escherichia coli (8-11). The protein products of these plas- 
mid-carried genes were produced in their native states — that 
is, unfused to other proteins (8-11). The essential feature of this 
method is to position a "portable promoter" in front of the 
cloned gene so that the gene is efficiently transcribed and the 
resultant mRNA is efficiently translated, beginning at an ini- 
tiation codon — e.g., AUG. This AUG may, but need not be, that 
which directs initiation of synthesis of the native protein in vivo. 
The procedure of Roberts et al. (9, 10) enables us to position the 
portable promoter at various positions in front of the cloned 
gene by using recombination in vitro. The method of Guarente 
et al. (11) exploits lac genetics to identify those positionings that 
direct efficient transcription and translation of the cloned gene. 
This latter procedure eliminates the need for any assay for the 
gene product to identify those bacteria that express the desired 
proteins (see Method of Gene Expression in Results). 

The protein and DNA sequence data referred to above in- 
dicate that both F-IF and pre-F-IF bear methionine residues 
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at their amino termini (4, 6). We describe in this paper the 
application of the method of Guarente et al. (11) to the F-IF 
gene. We describe the construction and identification of plas- 
mids that direct the efficient synthesis of two proteins. The 
primary sequences apparently correspond to the sequence of 
F-IF in one case and to that of pre-F-IF in the other. F-IF 
produced in bacteria prevents viral growth as assayed in 
vitro. 

MATERIALS AND METHODS 

DNA Constructions. All techniques were as described by 
Guarente et al. (11). pTR56 (see Fig. 1) was constructed in two 
steps as follows. First, a plasmid (pLGlll) was constructed that 
bears a fft'ndlll synthetic linker three nucleotides before the 
ATG of pre-F-IF (6). This was accomplished by joining four 
DNA fragments: (i) a BamHl-Pst I backbone fragment from 
pLG300 (11); (if) a Hindll-Bgl II fragment containing the 
entire F-IF coding sequence from TpIF319-13 (6); (Hi) a 
Pst 1-Pvu II fragment frompGLlOl bearing the 5' end of the 
amp gene; and (iv) a HindlH linker. Ligation of these frag- 
ments fuses two complementary sticky ends (Pst 1-Pst I and 
BamHl-Bgl II) and two blunt ends (Pvu II-//indIII linker and 
fftndlll linker-Hindll). The Pst I joining thus reconstitutes 
amp. Second, pTR56 was constructed by joining three frag- 
ments: (i) a Pst l-Pst I fragment from pLGlll bearing the 5' 
portion of amp and the 5' portion of the F-IF gene; (ii) an in- 
ternal fragment of the F-IF gene extending from the Pst I site 
to the first downstream Hinf site (6), the latter having been 
rendered flush by DNA polymerase I (12); and (Hi) a Pst I- 
BamHl fragment from pLG300 (11) that bears the 3' end of 
amp and a 3' fragment of lacZ. The BamHl end of this frag- 
ment had been rendered flush by DNA polymerase I (12). Li- 
gation of these fragments generates two Pst l-Pst I fusions, one 
of which reconstitutes amp, as well as a Bam HI (filled inJ-Hinf 
(filled in) fusion joining the 5' portion of the F-IF gene in phase 
with the 3' portion of lacZ (cf. refs. 6 and 11). 

Radiolabeling of Proteins. The procedure has been de- 
scribed (11). Pulse labeling was with 300 juCi (1 Ci = 3.7 X 10 10 
becquerels) of [^Jmethionine and chasing was achieved by 
adding a 1000-fold excess of unlabeled methionine. This was 
done at 30°C. Labeled extracts were run on 15% acrylamide 
gels for analysis as described (13). 

Preparation of Bacterial Extracts. Extracts were prepared 
essentially as described by Nagata et al. (14) except phenyl- 
methylsulfonyl fluoride and EDTA were added in the lysis 
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buffer at concentrations of 10 Mg/ m ' an ^ 6 mM, respec- 
tively. 

F-IF Assay. F-IF activity was determined by the cytopathic 
effect (CPE)-inhibition method essentially as described (2) with 
international standard F-IF (National Institute of Allergy and 
Infectious Diseases) as reference. 



bacteria were transformed with a mixture of these plasmids, 
and those (rare) transformants expressing high levels of /5-gal- 
actosidase were identified on the appropriate indicator plates. 
Sequence analysis revealed that two such isolates bore the 
portable promoter adjacent to the ATG of pre- F-IF in one case 
(pLG104) and the ATG of mature F-IF in the other (pLGl 17) 



RESULTS 

Method of Gene Expression. We desired to abutt a DNA 
fragment bearing our portable promoter to the ATG encoding 
the amino-terminal methionine of pre- F-IF in one case and to 
the ATG encoding the amino-terminal methionine of mature 
F-IF in another case. This promoter fragment encodes the 
promoter and leader sequence of the lac operon but terminates 
two base pairs before the ATG that encodes the amino-terminal 
methionine of /?-galactosidase (8-10). The mRNA encoded by 
the fragment does contain those bases called the Shine-Dal- 
garno (SD) sequence that, in conjunction with a properly po- 
sitioned AUG, promote ribosome binding and initiation of 
protein synthesis (9-11). We have called the combination of the 
SD sequence of lacZ with the AUG of another gene a "hybrid 
ribosome-binding site," and we have constructed various hybrid 
ribosome-binding sites to express a variety of prokaryotic and 
eukaryotic proteins (8-10). Although it is clear that the SD- 
AUG separation must be small (3-11 bases), we cannot yet 
predict the optimal placement nor do we know the effect of 
base sequence (15). 

Our strategy, in outline, was as follows. Following the strategy 
of Guarente et al. (1 1), we fused a 5' portion of the F-IF gene 
with a large 3' fragment of the lacl gene of E. coli (Fig. 2). This 
lacZ sequence encodes enzymatically active /3-galactosidase, 
and as indicated in the figure, it is fused to a portion of loci. This 
fused gene, carried on a plasmid, was not expressed, however, 
because it lacks transcription and translation initiation se- 
quences. We then opened the plasmid at a unique restriction 
enzyme site 5' to the F-IF gene and resected DNA to various 
extents by using exonuclease (Fig. 2 A and B). This enabled us 
to position the portable promoter at many points near and in 
the 5' region of the F-IF gene (Fig. 2C). Lactose-negative 
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FIG. 1. Construction of pTR56. The arrows indicate the direc- 
tions of transcription of the amp gene and the pre-F-IF gene. 
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FlG. 2. Construction of plasmids that direct the synthesis of 
pre-F-IF (pLG104R) and mature F-IF (pLGll7R). Plasmid pTR56 
contains the 5' end of the F-IF gene fused to the 3' end of the lacI-Z 
gene and unique HindUl and Bam HI sites 6 and 116 bases upstream 
from the ATG of pre-F-IF. The ATGs encoding the amino terminus 
of pre- and mature F-IF are separated by 60 base pairs. Step 1, the 
plasmid was opened by cleavage with Hindlll. Step 2, the linearized 
plasmid was digested for various times with the exonuclease Bal 31. 
Step 3, the resected plasmids were cut with Bam and religated with 
an excess of a fragment bearing the lac promoter that terminates 5 
base pairs beyond the lac SD sequence. This 105-base-pair portable 
promoter is bounded by Bam and Pvu II ends. It is a derivative of the 
lac 95 portable promoter (see ref. 16). Strain LG90 [F~, A(/ac- 
pro)Xlll] was transformed with the ligation mix and was plated on 
lactose MacConkey indicator agar. Red (lactose-utilizing) colonies 
were picked, and the structures of two plasmids, pLG104 and pLG117, 
were determined. The relevant portions of these two DNA molecules 
are shown in the figure. Step 4, the genes for pre-F-IF and mature F-IF 
were reconstructed. Plasmids pLG104 and pLGll7 were cut with Pst 
I, and the relevant portion was joined with a Pst l-Pst I fragment 
bearing the 3' end of the F-IF gene and the 3' end of amp. The des- 
ignation R indicates the reconstructed F-IF-producing form of the 
plasmids. 
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SD -21 tl 

A - 1 AGGAl AACAG I AC [ATG] CC [ATG1 aGC - 
pLG 104 (pre-F-IF) 

so *i 
- lAGGA| AACAG[CC |ATG| AGC - 
pLG117(F-IF) 

SD H 

B - | AGGAl AACAG CT jATGl ACC - 

lacZ 

FlG. 3. (A) Nucleotide sequences of the DNAs around the regions 
encoding ribosome-binding sites of genes encoding pre-F-IF (in 
pLG104R) and mature F-IF (in pLGH7R). The vertical lines separate 
sequences carried on the lac portable fragment from F-IF sequences. 
The boxes indicate the lacZ SD sequence and the ATGs encoding the 
amino-terminal methionines of pre-F-IF and of mature F-IF. (B) 
Corresponding region of wild-type lacZ (17). 

(Fig. 2C; see also Fig. 3). The DNA containing the kcZ 3' gene 
fragment was removed and replaced with the 3' end of the F-IF 
gene, regenerating intact F-IF (pLG117R) and pre-F-IF 
(pLG104R), in which R indicates the reconstituted F-IF gene 
(Fig. 2D). 

Plasmids that Direct the Synthesis of F-IF(pLG117R) and 
pre-F-IF (pLG104R). Fig. 3A shows the DNA sequence around 
the junctions of the portable promoter and the ATGs encoding 
the amino terminus of pre-F-IF (pLG104 and pLG104R) and 
that of mature F-IF (pLG117 and pLG117R). In each case, the 
SD sequence (AGGA) of lacZ carried on the portable promoter 
has been positioned seven base pairs from the ATG of F-IF. This 
is precisely the distance between the SD sequence and the ATG 
found in wild-type lacZ (Fig. SB) (17). These particular 
placements were rare. In the screening that yielded pLG104, 
lactose-utilizing colonies appeared at a frequency of approxi- 
mately 5%. Plasmid pLG117 was identified in a separate ex- 
periment involving more extensive exonucleolytic digestion. 
In this case, lactose-utilizing colonies were found at a frequency 
of only approximately 0.01%. § 

Proteins Produced by pLG104R and pLG117R. The ex- 
periment of Fig. 4 uses the "maxicell" technique to display those 
proteins encoded by pLG104R and pLG117R. Suitably treated 
maxicells differentially incorporate radioactive amino acids into 
plasmid-encoded proteins that are easily visualized by auto- 
radiography after polyacrylamide gel electrophoresis (11, 18). 
pLG104R and pLGl 17R each direct the synthesis of one protein 
in addition to ^-lactamase. In the case of pLG117R, a protein 
was produced with a molecular weight of approximately 
20,000, consistent with the predicted (from the DNA sequence) 
size of unglycosylated mature F-IF (6). In the case of pLG104R, 
the protein produced had a molecular weight of about 23,000, 
which corresponds to the predicted molecular weight of un- 
glycosylated pre-F-IF. 

Fig. 4 also shows the fate of pulse-labeled pLG104R and 
pLG117R proteins in a maxicell experiment. Densitometry 
tracing of the gel (not shown) reveals that pre-F-IF was com- 



§ This screening also yielded a third fusion, pLGl 15. In this case, the 
portable promoter is abutted to nucleotide 18 of the sequence en- 
coding the F-IF leader some 20 bases from the nearest possible ini- 
tiator triplet (6). Experiments similar to the experiment of Fig. 4 
suggest that, in this case, protein synthesis initiates at the internal ATG 
located at position 175 in the DNA sequence published by Taniguchi 
et al. (6, 7). This ATG is fortuitously preceded by an SD-like sequence 
in the F-IF gene. We do not understand how the portable promoter 
placement in pLG115 enhances the efficiency of utilization of this 
ATG. 




FIG. 4. Proteins encoded by plasmids pLG104R (pre-F-IF) and 
PLG117R (F-IF). Plasmid proteins were labeled by the maxicell 
technique (18). Proteins encoded by pLG104R were labeled for 5 min 
with [^S] methionine (lane 1) and then chased with nonradioactive 
methionine for 12 min (lane 2) or 50 min (lane 3). Labeling and chasing 
were terminated by freezing the cells. After spinning for 2 min in an 
Eppendorf centrifuge, the cell pellets were suspended in Laemmli's 
sample buffer (13), incubated for 3 min at 100°C, then subjected to 
analysis by polyacrylamide gel electrophoresis as described (13). 
Similarly, proteins encoded by pLG117R were labeled for 5 min (lane 
4) and chased for 12 min (lane 5) or 50 min (lane 6). Lanes 7, 8, and 
9 display the same labeling protocol performed on plasmid pGLlOl, 
which encodes only ^-lactamase. Plasmid pLG302-2R, which directs 
synthesis of rabbit /?-globin (5000-7500 molecules per cell) (11) was 
likewise analyzed as shown in lanes 10, 11, and 12. 

pletely degraded in a 50-min chase (lanes 1 and 3), and F-IF 
was about 50% degraded in that time (lanes 4 and 6). Although 
processing of /3-lactamase from pre-/8-lactamase is evident, 
pre-F-IF is apparently not processed. 

We estimated the level of our F-IF and pre-F-IF synthesis 
in two ways. First, we measured the amount of radioactivity 
incorporated during a 5-min pulse into F-IF and pre-F-IF in 
a maxicell experiment. We compared these values with a known 
standard, namely, rabbit /3-globin synthesized by the plasmid 
pLG302-2R (11) (see Fig. 4). This comparison suggests that, 
were the F-IF molecules stable, the steady-state levels would 
be 5000-10,000 molecules per cell. Second, we found that 
growing cells bearing plasmids pLG117 and pLG104 synthesize 
about 1200-1400 units of /3-galactosidase (19). Assuming that 
the hybrid F-IF (or pre-F-IF)-/3-galactosidase molecules have 
the same specific activity as /3-galactosidase, this value repre- 
sents 5000-10,000 molecules per cell (11). Previous experience 
with /3-galactosidase hybrid proteins modified at their amino 
termini suggests that they are stable during growth (20). 

Antiviral Activity of the Human Fibroblast Interferon 
Polypeptide Synthesized in E. coli. Extracts of bacteria 
bearing plasmid pLG117R inhibited growth of vesicular sto- 
matitus virus on human fibroblast cells in a typical interferon 
assay (inhibition of cytopathic effect) (2, 21). This activity was 
abolished by antibody to F-IF but not by antibody to leukocyte 
interferon (Fig. 5). Moreover, extracts of bacteria carrying 
pLG104R, pLG115R, and pBR322 failed to manifest antiviral 
activity. The antiviral activity directed by pLG117R survives 
a sojourn at low pH (pH 2.0) or treatment with DNase and 
RNase, but it is abolished by trypsin treatment (data not shown). 
Assuming that unglycosylated F-IF is as active as glycosylated 
F-IF [2 X 10 8 units/mg (22)], the activity we typically recov- 
ered would correspond to approximately 50 interferon mole- 
cules per bacterial cell. 

DISCUSSION 

Our results strongly suggest that our application of the method 
of Guarente et al. (11) to the F-IF gene isolated by Taniguchi 
et al. (6, 7) has produced two plasmids (pLG104R and 
pLG117R) that direct the synthesis of proteins whose primary 
sequences correspond, respectively, to those of pre-F-IF and 
F-IF. 
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'X«T'*' ^ vS ! s We have only hints as to why pre-F-IF, synthesis of which 

t ™ J is directed by pLG104R, is totally inactive in our assay. It is 

possible that the unprocessed form is inherently inactive. The 
^ # O 0" It O maxicell experiments show no indication that pre-F-IF is cor- 

rectly processed and suggest that it is hyperlabile compared to 
F-IF. Oxender et al. (23) have described a case (the leucine- 
specific binding protein) in which the mature form is less sen- 
sitive to proteolytic degradation than is the precursor bearing 
a hydrophobic leader. It is possible that pre-FTlF is exported 
to the periplasm with or without concomitant cleavage of its 
leader and is rapidly destroyed there. 



Fig. 5. Characterization of antiviral action of bacterially pro- 
duced F-EP as assayed in vitro. Extracts of bacteria bearing pLGll7R 
were added in 1:2 dilutions to human fibroblast cells (FS-7) growing 
in microtiter dishes. In two cases, these extracts were pretreated 
separately with antibody to F-IF and with antibody to leukocyte in- 
terferon (Le-IF). The treated cells were challenged with vesicular 
stomatitus vims and stained with crystal violet (2). Wells containing 
cells uninfected with virus or protected against viral infection stain 
darkly with this dye. Also shown are the effects of no extract and an 
extract of a strain bearing plasmid pBR322. 

These plasmids bear a lac portable promoter abutted to the 
ATG encoding the amino-terminal methionine of pre-F-IF 
(pLG104R) and to the ATG encoding the amino-terminal 
methionine of F-IF (pLG117R). In each case, these promoter 
placements were originally recognized by their abilities to ef- 
ficiently direct synthesis of a F-IF-/3-galactosidase hybrid 
protein. In each case, the distance separating the SD sequence 
of the lac promoter from the ATG is precisely that found in the' 
case of wild-type lacZ. When compared with pBR322, 
pLG117R and pLG104R each direct synthesis of one new 
protein of molecular weights approximately 20,000 and 23,000, 
respectively. These are the sizes expected for unglycosylated 
proteins with the primary sequences of F-IF and pre-F-IF as 
predicted from the DNA sequence of Taniguchi et al. (6). In 
previous cases, we have found that formation of such hybrid 
ribosome-binding sites, not dissimilar to the ones shown here, 
have directed correct initiation of protein synthesis (rabbit 
0-globin, simian virus 40 tumor (t) antigen, A repressor) as de- 
termined by direct amino acid sequencing (8-1 1 ). In all of these 
cases, the amino-terminal methionine was maintained. We have 
not determined the amino acid sequence of our bacterially 
produced F-IF. 

Plasmid pLG117R, but no other plasmid described here, 
directs antiviral activity characteristic of F-IF under our assay 
condition. The amount of this activity is much lower (only about 
1%) than that predicted on the basis of the rate at which the 
protein is synthesized in our bacteria. We imagine the following 
possible explanations for this difference. 

(i) The protein is rapidly degraded. The pulse-chase ex- 
periment of Fig. 5 indicates that the bacterially produced F-IF 
protein is somewhat unstable under the particular conditions 
of that experiment. But this degree of instability would not 
account for the difference between the expected and the ob- 
served result. The conditions under which we visualized the 
proteins (i.e., in maxicells) may not accurately reflect the extent 
of degradation in growing cells. 

(ii) Bacterially synthesized F-IF, which is unglycosylated, 
may have low specific activity in our in vitro assay. 

(Hi) Our method of extraction may not efficiently recover 
active F-IF. We have not systematically varied growth condi- 
tions or methods of extraction. 
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